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Preface 


A set theory textbook can cover a vast amount of material depending on the mathematical 
background of the readers it was designed for. Selecting the material for presentation in 
this book often came down to deciding how much detail should be provided when explaining 
concepts and what constitutes a reasonable logical gap which can be independently filled in 
by the reader. The initial chapters of this book will appeal to students who have little expe- 
rience in proving mathematical statements, while the last chapters, significantly denser in 
subject matter, will appeal considerably more to senior undergraduate or graduate students. 
Choice of topics and calibration of the level of communication is based on the estimated 
mathematical fluency of the target students. The first part of this book (Chapters 1 to 21) 
is written for intermediate level math major students in mind. At this level, most students 
have not yet been exposed to the mathematical rigor normally found in most textbooks 
in set theory. The pace at which new concepts are introduced at the beginning is what 
some may subjectively consider as being quite “leisurely”. The meaning of mathematical 
statements is explained at length and their proofs presented in great detail. As the student 
progresses through the course, he or she will develop a better understanding of what con- 
stitutes a correct mathematical proof. To help attain this objective, numerous examples of 
simple straightforward proofs are presented as models throughout the text. 

The subject material is subdivided into ten major parts. The first few are themselves 
subdivided into “bite-size” chapters. Smaller sections allow students to test their under- 
standing on fewer notions at a time. This will allow the instructor to better diagnose the 
understanding of those specific points which challenge the students the most, thus helping 
to eliminate obstacles which may slow down their progress later on. 

Each chapter is followed by a list of Concepts review type questions. These questions 
highlight for students the main ideas presented in that section and help them deepen un- 
derstanding of these concepts before attempting the exercises. The answers to all Concept 
review questions are in the main body of the text. Attempting to answer these questions will 
help the student discover essential notions which are often overlooked when first exposed 
to these ideas. 

Textbook examples will serve as solution models to most of the exercise questions at 
the end of each section. Exercise questions are divided into three groups: A, B and C. The 
answers to the group A questions normally follow immediately from definitions and theorem 
statements presented in the text. The group B questions require a deeper understanding 
of the concepts, while the group C questions allow the students to deduce by themselves a 
few consequences of theorem statements presented in the text. 

The course begins with an informal discussion of primitive concepts and a presentation 
of the ZFC axioms. We then discuss, in this order, operations on classes and sets, relations 
on classes and sets, functions, construction of numbers (beginning with the natural numbers 
followed by the rational numbers and real numbers), infinite sets, cardinal numbers and, 
finally, ordinal numbers. It is hoped that the reader will eventually perceive the ordinal 


numbers as a natural logical extension of the natural numbers and as the “spine of set 
theory” — like many authors described them. Towards the end of the book we present a 
brief discussion of a few more advanced topics such as the Well-ordering theorem, Zorn’s 
lemma (both proven to be equivalent forms of the Axiom of choice) as well as Martin’s 
axiom. Finally, we briefly discuss the Axiom of regularity and a few of its implication. A 
brief and very basic presentation of ordinal arithmetic properties is then given. 

The pace and level of abstraction increase considerably when ordinals are introduced. It 
is hoped that everything which is presented before this point will allow the students to mas- 
ter various proof techniques while simultaneously developing a feeling for what constitutes 
the essence of set theory. The format used in the book allows for some flexibility in how 
subject matter is presented, depending on the mathematical maturity of the audience or 
the pace at which the students can absorb new material. A determining factor may be the 
amount of practice that students require to understand and produce correct mathematical 
proofs. Some instructors may decide to use the first twenty chapters of the book as a text 
for an “Introduction to mathematical proofs” course. 


Students who already possess a substantial amount of mathematical background may 
feel they can comfortably skip many chapters without loss of continuity, since these contain 
notions which are well-known to them. The following order sequence will allow readers with 
the required background to advance more quickly to the meat of the textbook: Chapter 
1 on the topic of the ZFC-axioms can be immediately followed by chapters 13 and 14 on 
the topic of natural numbers, chapters 18 to 22 on the topic of infinite sets and cardinal 
numbers followed by chapters 26 to 29, 32 and 33, on ordinals, and finally, chapters 30 and 
31 on the axiom of choice and the axiom of regularity. 


As we all know, any textbook, when initially published, will contain some errors, some 
typographical, others in spelling or in formatting and, what is even more worrisome, some 
mathematical. Many readers of the text are required to help weed out the most glaring 
mistakes. If you happen to be a reader who has carefully studied a chapter or two of the 
book please feel free to communicate to me, by email, any errors you may have spotted, 
with your name and chapters reviewed. In the preface of further versions of the book, I will 
gladly acknowledge your help. This will be much appreciated by this writer as well as by 
future readers. It is always more pleasurable to study a book which is error-free. 


Robert André 
University of Waterloo, Ontario 
randreQuwaterloo.ca 


ISBN 978-0-9938485-0-6 
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Part I 


Axioms and classes 


Part I: Axioms and classes 1 


1 / Classes, sets and axioms. 


1.1 


Summary. In this section we discuss axiomatic systems in mathematics. We 
explain the notions of “primitive concepts” and “axioms”. We declare as prim- 
itive concepts of set theory the words “class”, “set” and “belong to”. These will 
be the only primitive concepts in our system. We then present and briefly dis- 
cuss the fundamental Zermelo-Fraenkel axioms of set theory. 


Contradictory statements. 


When expressed in a mathematical context, the word “statement” is viewed in a 
specific way. A mathematical statement is a declaration which can be characterized 
as being either true or false. By this we mean that if a statement is not false, then 
it must be true, and vice-versa. There exists a predetermined set of rigorous logical 
rules which can be used to help determine the true or false value of such statements. 
Whether one does mathematics as an expert or as a beginner, these elementary rules 
of logic must always be respected. An argument which does not respect one of these 
rules is said to be ”illogical”. Then, by combining various mathematical statements 
whose true or false values are known, we can logically determine the true or false value 
of other mathematical statements. A rule of logic looks something like this: 


If Q is true whenever P is true, and T is true whenever Q is true, then T is 
true whenever P is true. 


Such rules can be symbolically represented in a way that avoids the use of words. For 
example, the above statement is represented as: 


(P => Q)^A (Q >T) > (P >T) 


In this way, we can construct an elaborate system of mathematical statements each 
of which has been determined to be true or false. The logical steps which help us 
determine the true or false value of a statement is called a “mathematical proof”. 
Most readers have previously been exposed to this particular way of thinking in var- 
ious courses such as calculus and linear algebra. Basic rules of logic are normally 
not taught explicitly in such courses. It is however expected that a student who has 
sufficiently been exposed to rigorous mathematical arguments and has often enough 
attempted to formulate correct mathematical proofs — sometimes more successfully 
than others — progressively learns to distinguish valid logical arguments from ones 
that are flawed. Like learning to speak any language, formulating correct mathemat- 
ical arguments is a skill that is developed with practice. 


If the truth of a mathematical statement is logically deduced by combining statements 
previously known to be true, then clearly there had to be, at some point, a set of state- 
ments whose true-false values were not derived from previous statements. That is, 
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the process must start somewhere, with some initial statements whose true-false value 
were unknown. Such initial statements are not “deduced” but simply declared to be 
true based on nothing more than “common sense”. For example, one may declare the 
statement: “Distinct parallel lines cannot intersect” as being self-evident or being so 
“elementary” that it cannot be proved. Once we give ourselves a set & of self-evident 
statements and a list of rules that can be used to determine the true-false value of 
other statements then the universe Yy of all possible true statements derived from 
æ% is determined. This determined universe Yy of statements constitutes a mathe- 
matical theory which is ours to explore, or discover, one statement at a time. 


But what if the choice of our original set £ of statements was not a wise one? “How 
can it not be a wise one if based on common sense?” one might ask. Imagine this 
scenario: 


Say that from a set © of initial self-evident statements, a statement A has 
been shown to be true, and given that A is true it is deduced that statement 
B must be true, and from B we deduce that P is true. On the other hand 
it is shown that given A, statement D must be true and that from D we 
show that P is false. Hence, from A we have deduced that the statement P 
is both true and false. 


A statement which has been determined to be both true and false is referred to 
as a “contradictory statement” or a paradox. If a contradictory statement logically 
flows from what was assumed to be a paradox-free system, then the foundation of 
this system, as well as the methods used to determine the true or false value of 
statements, must be carefully scrutinized to determine the incorrect assumption(s) 
which allowed this “renegade” statement to emerge. In this book we will explore a 
specific mathematical system. It is hoped that in the process, the reader will be able 
to appreciate the skill and ingenuity required for the construction of this impressive 
mathematical structure. This system is called the “theory of sets” or more simply 
“set theory’. 


Sets. 


Most people are familiar with the notion of a set and its elements. “Sets” are viewed 
as collections of things while “elements” are viewed as those things which belong to 
sets. Normally, a set is defined in terms of certain properties shared by its elements. 
These properties must be well described, with no ambiguities, so that it is always 
clear whether a given element belongs to a given set or not. Being a “set” can also be 
an element property; so sets whose elements are sets exist. For example, the set S of 
all teams in a particular hockey league. The elements of the set S' are sets of hockey 
players. 
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Let us consider a few examples of entities we may consider to be sets. 


a) 


Let T denote the set of all straight lines in the Cartesian plane. For example, the 
set A = {(x,y) : y = 2x + 3} belongs to T while the set B = {1, 2,3} does not. 
We easily see that T is not an element of T since T is not a line in the Cartesian 
plane. 

Let U denote the set of all sets which contain infinitely many elements. This set 
is well-sdefined since we can easily distinguish those elements which belong to U 
from those that do not belong to U. For example, the subset {—2,0, 100} is not 
an element of U since it contains only three elements. We ask the question: Is the 
set U an element of U? To help answer this question, witness the sets 


Ao {0,1,2,3,---} 
Ay {—1,0,1,2,3,---} 
Ag = {—2, —1,0,1,2,3,---} 


An 


I 
ra 
3 
= 





1), -(n — 2),---,—2,-1,0, 1, 2,3,---} 


Every element in the set {Ao, A1, Ag, A2,...} belongs to U. Hence, U contains 
infinitely many elements. We conclude that U is an element of U. 

Define S to be the set of all “sets that are not elements of themselves”. For exam- 
ple, the set T described in example a) is in S since T is not an element of itself. 
The set U described in example b) does not belong to S' since U is an element of 
itself. 


We now look more closely at the three sets described above. Other than the fact 
that it is an extremely large set, there is nothing extravagant about the set T de- 
scribed in example a). On the other hand, the set U discussed in example b) also 
appears to be well-defined since a set which is infinite can easily be distinguished 
from one that is not. But the fact that this set is an element of itself makes one 
wonder whether we should allow sets to satisfy this property. On the other hand, 
it is difficult to express what could possibly go wrong with such sets. Let us now 
look closely at the “set” described in the example c): A set belongs to S only if 
it does not belong to itself. We wonder whether, like the U in example b), the 
set S is an element of itself. But S cannot belong to S since no element in S can 
belong to itself. So S is not an element of itself. Then, by definition of S, S would 
then be an element of S. This doesn’t make sense. There is obviously a problem 
with the “set” described in example c). Even if it was fairly easy to detect the 
contradiction which follows from example c) specifically determining the source of 
this contradiction can be more difficult. 
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Example c) nicely illustrates what is called a paradoz. As we mentioned earlier, a 
paradox consists of two contradictory statements both of which logically flow from 
what was thought to be a well understood and clearly defined concept. In this case, 
to say that the statement “S is an element of S” is true means that the statement “S 
is an element of S” is false, and vice-versa. For many, this is just a play on words; 
it may seem harmless enough. But for mathematicians this was not a trivial matter. 
The discovery of this particular paradox by Bertrand Russell was equivalent to the 
uncovering of a malicious virus lying dormant in the heart of the operating system of 
a microcomputer. It cannot be ignored. The way sets are defined along with their 
universally accepted properties form the foundation of modern mathematics. Para- 
doxes such as this undermines the confidence we have in mathematics, a discipline 
which prides itself on its clarity of thought, a discipline which sees itself as a seeker of 
irrefutable truths. Once this flaw was exposed it was important to understand why 
we did not see this before. Mathematicians also wondered whether any other cracks 
in the foundation of mathematics existed, requiring immediate attention. 


Paradoxes are usually the result of some “erroneous assumptions”. In example c), we 
are making an erroneous assumption of some kind. But it is not obvious what the 
erroneous assumption is. Should we allow ourselves to talk of a “set that contains 
itself as an element”? Or maybe the erroneous assumption is to say that there exists 
a “set that contains all sets”. How do we decide which assumptions are acceptable 
ones and which are not? This problem motivated mathematicians to determine as 
clearly as possible what are acceptable properties for “sets”. Since most of modern 
mathematics can be derived from the notion of “sets” this question was labeled “High 
priority” in the early 1900’s. It is in this period that particular attention was given to 
developing a reliable axiomatic system which could serve as a foundation of modern 
mathematics. The modern set-theoretic axiomatic system which evolved as a result 
of these efforts is the main topic of this book. 


Axiomatic systems. 


An axiomatic system is normally set up by first declaring some primitive concepts or 
undefined notions. These primitive concepts carry no intrinsic meaning although the 
symbols or words used to represent them often convey some intuitive concept in the 
mind of the reader. That is, the words which represent this undefined notion are such 
that the user will more easily understand the properties which will be prescribed for 
this concept. Specific rules and properties which declare how these concepts relate to 
each other are then formulated; these rules and properties must allow mathematical 
constructs which are viewed as being important in our mathematical system. These 
properties and rules are called the “axioms”. 


Euclid’s axiomatic system. Euclid provided us with a useful model for constructing 
an axiomatic system. He is the first person known to apply the axiomatic method to 
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study the field of geometry. In his axiomatic system, the words “line” and “point” 
are primitive concepts. He instinctively recognized that some undefined terms would 
be required. He then described properties of “lines” and “points”. These properties 
are his axioms. These axioms are statements whose true-false values are not logically 
deduced from statements previously shown to be true. They are simply assumed to 
be true. The important point here is that he explicitly states what these “assumed 
to be true properties” are. The proposed primitive concepts and these axioms, when 
gathered together, constitute the foundation of the “Euclidean axiomatic universe” 
more commonly referred to as Euclidean geometry. Euclid justified his choice of ax- 
ioms by saying that these point and line properties were “self-evident”. Note that 
Euclid’s primitive concepts and axioms differ entirely from the set-theoretic axioms 
we will be studying. But the axiomatic method he used to study geometry has served 
as a valuable model for others who wanted to develop different mathematical systems. 
Euclid then used deductive reasoning to show that various geometric statements were 
true. The assumptions made were limited to 


1) the stated axioms, along with 


2) other statements previously shown to be true. 


In this way, Euclidean geometry came to be. In spite of Euclid’s best efforts, careful 
scrutiny of his work revealed that Euclid erred in certain ways. He unknowingly made 
assumptions which were neither stated as axioms nor previously proven to be true. 
In 1899, the mathematician David Hilbert revised the Euclidean axiomatic system by 
proposing three primitive concepts: point, straight line, plane. He also proposed 21 
axioms. In 1902, one axiom was shown to be redundant and so was eliminated from 
the list. These primitive concepts along with 20 axioms are now widely accepted as 
forming a firm logical footing for Euclidean geometry. 


The Zermelo-Fraenkel axiomatic system. 


The axiomatic system of set theory as we know it today was in large part developed 
in the period of 1908 to 1922 by Ernst Zermelo and Abraham Fraenkel. Mathemati- 
cians T. Skolem and John Von Neumann made slight modifications to these a few 
years later. These axioms are now referred to as the ZF-arioms which stands for the 
“Zermelo-Fraenkel axiomatic system”. 


In what follows we will strive to develop an intuitive understanding of what axiomatic 
set theory is all about. We will avoid logical formalism, a treatment of axiomatic 
set theory based entirely on symbols normally reserved for more advanced courses. 
Our approach to set theory is referred to as “naive set theory” in the sense that we 
use ordinary language (words, sentences) to better understand what the basic ax- 
ioms actually mean. We will see how these axioms are used to construct well-known 
sets such as the natural numbers and the real numbers. Finally, we will see how and 
why the chosen axioms serve as a widely accepted foundation of modern mathematics. 
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Primitive concepts and notation. What makes a set of primitive concepts and axioms 
suitable for a particular theory? Most will agree that these must satisfy the following 
conditions: 


1) The number of undefined terms and axioms must be as few as possible. 


2) Normally, an axiom should not be logically deducible from other axioms. (If one 
is deducible from the others this should be explicitly expressed. ) 


ow 
a” 


We should be able to prove from these axioms and concepts most of what we 
consider to be interesting or useful mathematics. Often, parts of the logical 
universe which is determined from the axioms are preconceived, in the sense 
that axioms are introduced so that certain mathematical statements will turn 
out to be true. This, of course, can turn out to be a “dangerous game”. But 
there has to be some motivation for choosing one statement as an axiom rather 
than choosing another. 


4) These axioms must not lead to any paradoxes. An axiomatic system which con- 
tains contradictions is either modified to one in which these contradictions do 
not occur or some axioms are simply discarded and replaced with others if needed. 


The primitive concepts in our theory. There will be three undefined notions in our 
axiomatic system. They are the words: 


“class” 
“set” 
“belongs to” 


The expression “belongs to” is often stated as “is a member of” or “is an element 
of”; it is usually abbreviated by the symbol, €. 

All objects in our theory are classes. There is nothing else. We will soon distinguish 
special kinds of classes. Once we have discussed a few axioms, we will define a set 
as being a special kind of class. A class which is not a set will be called a proper 
class. Some axioms will help us distinguish between those classes which are sets and 
those which are proper classes. Classes will be represented either by lower-case or 
upper-case letters. So we can write “Let x and A be two classes”. 


The expression 
rEA 


is to be read as “the class x belongs to the class A”, or “the class x is in the class 
A” or “x is an element of A”. However, no class will be representable by a lower-case 
letter, x, unless it is known that x € B for some class B. Those classes which can be 
represented by a lower-case letter, say x, will be given a special name: 
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If a class A is such that A € B for some class B, then we will refer to the 
class A as being an “element”. 


Elements are still classes; but they are special classes, since they “belong to” another 
class. So an element can be represented by both a lower-case or an upper-case letter. 
For example, if we write x € y or A € B this means that x, y and A are elements, 
while B may or may not be an element. 


Why is “element” not an undefined notion? The reader may find surprising that the 
object element is not expressed as an undefined notion. After all, we are accustomed to 
distinguishing elements from sets. Introducing a fourth undefined term was eventually 
seen as being superfluous. This became clear when we realized that we often view 
sets as being “elements” of other sets.! Witness: 


- Points (a,b) in the Cartesian plane are actually two-element sets {a,b} of real 
numbers stated in a particular order. 


- Rational numbers a/b can be described as the set of all two-element sets {a, b} 
of integers in a particular order where b is not 0. 


- Irrational numbers can be viewed as infinite sequences of rational numbers con- 
verging to a non-rational number, again a set. 


The axioms of set theory. 


We now give a “preview” of what the set-theoretic axioms are, keeping in mind that a 
full understanding of what they mean will only be developed when we actually invoke 
each of these in various situations where they are required. These are called the ZF- 
axioms. The reader will see how surprisingly few are required. At this point, much of 
this will look like gibberish, but as we prod through the subject matter, we will step 
by step develop a better understanding of what they mean. 


Primitive concepts: “class”, “set” and “belongs to”. 


Axiom A1 (Axiom of extent): For the classes z, A and B, [A= B] = |r € A & x €B] 


Axiom A2 (Axiom of class construction): Let P(x) designate a statement about x which 
can be expressed entirely in terms of the symbols €, V, A, =, >, V, brackets and variables 


T, 


Y, Z,..., A, B, ... Then there exists a class C which consists of all the elements x 


which satisfy P(x). 
Axiom A3 (Axiom of pair): If A and B are sets, then the doubleton {A, B} is a set. 


1There exists a branch of set theory in which mathematical entities which are neither sets nor classes are 
considered. These are referred to as “urelements”. We will not consider these in this text. 
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Axiom A4 (Axiom of subsets): If S is a set and @ is a formula describing a particular 
property, then the class of all sets in S which satisfy this property ¢ is a set. More 
succinctly, every subclass of a set of sets is a set.! 

Axiom A5 (Axiom of power set): If A is a set, then the power set Z (A) is a set. 

Axiom A6 (Axiom of union): If æ is a set of sets, then Ugey C is a set. 

Axiom A7 (Axiom of replacement): Let A be a set. Let (x,y) be a formula which asso- 
ciates to each element x of A an element y in such a way that whenever both ¢(z, y) 
and (x, z) hold true, y = z. Then there exists a set B which contains all elements y 
such that (x,y) holds true for some x € A.? 

Axiom A8 (Axiom of infinity): There exists a non-empty class A called a set that satisfies 
the condition: “X € A” > “X U{X} e€ A”. (A set satisfying this condition is called a 
successor set or an inductive set.) 

Axiom A9 (Axiom of regularity): Every non-empty set A contains an element x whose 
intersection with A is empty. 


Another “special” axiom is usually stated separately from the other nine axioms above. 
It is viewed by many as being different in nature. It was also, at least initially, quite 
controversial. It is called the Axiom of choice. 


Axiom of choice: For every set æ of non-empty sets there is a function f which associates 
to every set A in æ% an element a € A. 


In this text we will refer to this set of nine axioms viewed together with the Axiom of 
choice as “ZF + Choice” or simply by ZFC.? 


The Axiom of choice. The controversy surrounding the Axiom of choice requires 
some explanation. The Axiom of choice is an axiom which was added after most of 
the ZF-axioms were widely accepted as a foundation for modern mathematics. It is 
so subtle a concept that many early mathematicians unknowingly invoked it in their 
proofs. That is, it was invoked without stating it explicitly as an assumption. Some 


1This axiom is more often expressed as the Axiom of comprehension, Axiom of Specification or Axiom 
of separation. It is in fact many axioms (which, when viewed together, are referred to as a schema) each 
differing only by the formula ¢ it refers to. So to be more precise, given a formula ¢ in set theory language, 
we would refer to it as axiom A4(@) rather than A4. 

?This axiom is more often expressed as the Replacement axiom schema since it is in fact many axioms 
each differing only by the formula ¢ it refers to. So to be more precise, given a formula ¢ in set theory 
language, we would refer to it as axiom A7(¢) rather than A7. It essentially allows us to confirm that if the 
domain A of a set f is a set, then the image f[A] is a set. 

3Note that some of the ZF axioms listed have been shown to follow from the others. So some set theory 
texts may omit one or more of these from their formal list of ZFC axioms. Since most of these axioms are 
non-controversial we will adopt, for this text, this list of 10 axioms as the ZFC-axioms. The reader should 
simply be alerted to the fact that the list of the ZFC axioms may vary from text to text. 
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mathematicians publicly questioned this assumption, asking openly whether the word 
“obviously” was sufficient justification for using it. These questions could not be 
ignored. Numerous attempts at proving this axiom from the ZF-axioms failed. In 
1963, it was finally proven that neither the Axiom of choice, nor its negation, can be 
proven from the ZF-axioms. This implied that we are free to state it as an axiom, 
along with the other ZF-axioms, without fear of producing a contradiction. A lengthy 
debate on whether this statement should be included with the other “fundamental” 
ZF-axioms followed. Some described it as “the most interesting and, in spite of its late 
appearance, the most discussed axiom of mathematics, second only to Euclid’s axiom 
of parallels which was introduced more than two thousand years ago” (Fraenkel, Bar- 
Hillel & Levy 1973). Eventually, it was felt that “not accepting” the Axiom of choice 
closes the door to many fundamentally important results of modern mathematics. 
One could say that the Axiom of choice had already been used so extensively that it 
was deeply ingrained in the modern mathematical fabric; we were “addicted” to the 
Axiom of choice, so to speak. 


Even though proofs that invoke the Axiom of choice are widely viewed as being ac- 
ceptable, it is often felt that a correct proof that does not invoke the Axiom of choice 
is preferable to a simpler proof which invokes it. This is because it assumes the exis- 
tence of something that can neither be seen nor constructed. It is viewed somewhat 
pejoratively by some as the “magic wand” that magically opens closed doors. For this 
reason, when proving a statement, it is customary to point out explicitly the steps 
where the Axiom of choice is invoked. Actually, there is a general consensus on one 
point: The Axiom of choice should not be listed with the ZF-axioms; it should be 
set apart in a category of its own. This is why we refer to this group of axioms as 
“ZF +Choice”, or simply ZFC. One can view this as some sort of compromise. 


A few more words on these axioms. 


Even though the words class and set are undefined, the axioms will allow us to perceive 
them (once we can decode them) as “collections of objects”. It is still too early to 
extract the full meaning of the axioms stated above. But the reader will feel more at 
ease if we interpret at least certain aspects of these immediately. 


The axiom A1, 
“For the classes z, A and B, [A = B] & [z € AS z € BI.” 


is an axiom which states that “a class is defined by its elements”. If two classes are 
equal, then they have the same elements. Conversely, two classes which have the same 
elements are the same class. 


We now examine more closely the axioms A2, A9 and the Axiom of choice, in random 
order. 


Axiom A2: (Class construction) 
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“Let P(x) designate a statement about x which can be expressed entirely in 


terms of the symbols €, V, A, =, >, V, brackets and variables x, y, z,..., A, B, 
... Then there exists a class C which consists of all the elements x which satisfy 
Pa 


states that we can use well-defined properties which can be expressed by the given 
symbols to construct classes. For example A= {X :u € X} and B=4{X:X =X} 
are different classes since the properties that characterize their elements are different. 
For the class A, P(x) is the property u € X while for the class B, P(x) is the property 
Ne Se 


Axiom A8 (Infinity): 


“There exists a non-empty class A called a set that satisfies the condition: 
“X € A” => “X U{X}e€ A”.” 


says that there exists a class called a set which is infinite in size. (This axiom also 
guarantees that at least one class called a set exists.) It essentially allows us to define 
the “natural numbers”, 0,1, 2,3,...,. 


Axiom of choice: 


“For every set & of non-empty sets there is a rule f which associates to every 
set A in æ an element a € A.” 


says that given a set of non-empty sets, there exists a certain type of function. But it 
does not show how to construct or find such a function. 


Note that axioms Al and A2 refer only to classes while all the other axioms (Axioms 
A3 to A9 and the Axiom of choice) are “set axioms”. The set axioms determine what 
kind of objects exist in the universe of all sets. 


Axioms A2, A3, A4, A5, A6 and A7 are “constructive” axioms since A2 gives us a way 
to construct a class by referring to a property. Axioms A3 to A7 provide a method to 
construct new sets from ones that are known to exist. 


Axiom A9, the Axiom of regularity, is sometimes referred to as the “useless” axiom 
by some. Others don’t consider it as a basic axiom since most of mathematics which 
is based on set theory does not require it. It will be invoked only in the last chapter of 
this book. Although it is not obvious, just from reading it, this axiom actually states 
that “those non-empty classes which don’t have a least element are not sets”. It is 
in fact an axiom which does not allow certain types of sets to exist in the universe of 
sets. It is of an exclusionary nature. The other axioms (except for axiom A1) increase 
the number of sets in the universe of sets. 
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The Axioms A4 (Subsets) and A7 (Replacement) each represent many axioms. We 
refer to such axioms as schema.! Axiom A4 speaks of a set S and a particular formula 
ġ describing a property. For each property we have a different Axiom. Given ¢, we 
could say the “Axiom A4 for @”. Axiom AT speaks of a set A and a class B of sets 
along with a particular formula (x,y) which plays the role of a function (normally 
referred to as a functional). For each functional, ¢(x, y), we have a different axiom. 


1.7 Some things we may immediately wonder about. 


As one may suspect, when formally expressed, axioms do not contain words. The 
ZFC-axioms are expressed using symbolism of first order logic. For example, when 
formally stated, an axiom may look like this: 





VaVydz(x2 € zVy € 2) 


This is the Axiom of pair. We use the words and sentences to develop an intuitive 
understanding of what this code means. 


A second point one may wonder about: Are the ZF-axioms consistent? That is, do 
we know for sure that the ZF-axioms, as stated, will never yield some contradiction? 
If one day we actually encounter a contradiction that flows from these axioms, then 
we can answer: “No, the ZF-axioms are not consistent, since we have revealed a 
contradiction which flows from these axioms!” If such a contradiction is discovered 
we must tinker some more with the set-theoretic axioms to correct the flaw. 


But as long as we do not encounter such a paradox, the answer to this question is: 
“We don’t know for sure whether the ZF-axioms are consistent.” It has been shown 
that using only the ZF-axioms, it is impossible to prove or disprove that the ZF- 
axioms are consistent. It is the “nature of the beast”, so to speak. Since new forms of 
mathematics is uncovered every day, it is possible that next week, in a hundred years 
or in a thousand years someone will discover that ZF is inconsistent. “Set theory” 
is, as the words indicate, just a theory. By their very nature, all theories evolve to 
explain newly discovered previously unknown facts. The ZF -set-theoretic system is 
no different. As a foundation of modern mathematics, the ZF-set-theoretic system 
seems to serve its purpose well; it is the best theory we have today, even though some 
day we may discover significant ways of improving it. 


Concepts review: 


1. What is Russell’s paradox? 


2. Why do paradoxes occur? 


1A dictionary describes schema as meaning “an underlying organizational pattern or structure”. 
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. What are three primitive concepts of set theory? 

. What is the difference between a class, a set and a proper class? 
. When is a class called an element? 

. Which classes can be represented by a lower case letter? 

. What does ZFC stand for? 


. How many axioms belong to ZFC? 
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2 / Constructing classes and sets. 


Ki 


Summary. In this section we define the symbols “=” and “C” and discuss Az- 
ioms Al to A5. We show how Axioms Al and A2 are used to construct classes. 
Axioms A3 to Ad are used as tools to construct sets. We distinguish between 
classes, sets and elements by exhibiting a class which is not an element and a 
class which is not a set. We also show that all sets are elements. We introduce 
the concept of “power set of a set” as a set constructing tool. 


2.1 Basic statements, definitions and notation. 


To discuss the axioms and some of their immediate consequences we first define a few 
words and symbols that will allow us to communicate certain ideas more efficiently. 
We first confirm that every class is equal to itself. This is not an axiom, since it is an 
immediate consequence of axiom A1. 


Theorem 2.1 For any class C, C=C. 


This follows from axiom Al: Since z € C > zx € C and z E€ C > x E€ C then C=C. 


If the statement “A = B” is false then we will write A £ B. 


Definition 2.2 If A and B are classes or sets we define A C B to mean that every element 
of A is an element of B. That is, 


AC Bifand only if re A>xreB 


If A C B we will say that A is a subclass (subset) of B. If A C B and A # B we will say 
that A is a proper subclass (proper subset) of B; in this case we write A C B. So “A C B” 
is a shorter way of saying “A C B but A # B”. 


We restate the Axiom of construction: 
Axiom A2: If P(x) is a property of an element x which can be expressed 
entirely in terms of the symbols €, V, A, =, >, V, brackets and variables z, y, 
z,.., A, B, ..., then there exists a class C which consists of all the elements 
x which satisfy P(x). 


'Do not confuse C with €. When we say that “the class A belongs to the class B” we mean that A € B, 
not AC B. 
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This axiom refers to properties of elements which can be expressed in terms of logical 
symbols. Many students may have used some or all of these symbols before; for 
completeness, we explicitly state how these symbols should be interpreted: 

€ is given the meaning “is an element of” 
V is given the meaning “or” 
A 


is given the meaning “and” 


J 


is given the meaning “not” 


= is given the meaning “implies” 





4 is given the meaning “there exists” 


YV is given the meaning “for all” 


The Axiom of construction allows us to construct a class by first defining a property 
P and then gathering together all the elements possessing this property to form the 
class 

C={A: A possesses the property P} = {x : P(x)} 


This class is succinctly expressed as: {x : P(a)}. The brackets { } refer to a “class”. 
The symbol, P(x), means “x possesses property P”. The lower case symbol, x, refers 
to a “class which is an element of another class”. Elements are the only classes that 
can be denoted by a lower case letter. 


A word of caution: Axiom A2 allows us to gather together all the “elements” that 
possess a property P, not all “classes” that possess a property P. For if the word 
“element” is replaced with the word “class”, then we easily obtain a paradox. Witness 
the class, C, defined as follows: 


C={A: Aisa class and A satisfies the property =(A € A)} ={A:A¢ A} 


which leads to Russell’s paradox, since neither C ¢ C 4 C e C is both true and 
absurd. 


Also, in expressions such as 
“the class {C : C = C}” 


it is understood that C must be an element. The expression {x : x = x}, means the 
same thing except it emphasizes that the classes it refers to are elements. 


Properties of classes. 


Axiom A1 allows classes to have the properties normally attributed to those things 
we call “collections of objects”. After all, this is what we would like classes and sets 
to be. The axioms are developed with a preconceived idea of what the only objects 
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(classes and sets) in our set-theoretic universe are. The statements in the following 
theorem are all logical consequences of these axioms. They are easily seen to hold 
true, but it is good practice to explicitly write out the proofs. 


Theorem 2.3 If C, D, and E are classes (sets) then: 


a) C=D>D=C. 


CC DandDCCSC=D. 


) 
b) C= PD and p= b= CSF. 
c) 

) 


d) CC DandDCESCCE. 


Proof: 


a) Suppose C = D. Then by axiom A1, xz € C = x € D andz € D = zx € C. Then 
xe D =zx€EcC and ze C = zE nD. Hence, by definition of equality D = C. 


Proofs of b) to d) are left as an exercise. 


We said that all objects in our set-theoretic universe are classes. Some of those classes 
are elements provided these belong to another class. It is normal to ask whether there 
exists at least one class which is not an element. We answer this question in the 
following theorem. 


Theorem 2.4 There exists a class which is not an element. 


Proof: 
Let C be the class 
C = {A : A is an element and A satisfies (A € A)} = {x : x € x} 


By Axiom A2, the class Č is well-defined. Suppose C is an element. Then either C 
belongs to {x : x x} or it doesn’t. Both of these options lead to a contradiction. Then 
C is not an element, as required. 


2.3 The universal class and the empty class. 


The class Y = {x : x = x} is easily seen to be the class that contains precisely all 
elements. By theorem 2.3 part a), “Every element is equal to itself”. The class Y is 
called the universal class. In the proof of the theorem 2.4 we constructed a class C 
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which does not belong to Y. Hence, Y does not contain all classes. 


At this point, we have discussed only two of the primitive concepts: class and “belongs 
to”. From these we have defined “element”. The reader has maybe noticed that the 
word “set” has remained on the sidelines. We have not yet discussed this primitive 
concept, other than witnessing the very large set given to us “for free” by the Axiom 
of infinity. The Axiom A2 will allow us to construct a much smaller set. We start 
with the following definition. 


Definition 2.5 The Axiom A2 authorizes us to call C = {x : x # x} a class. Since we 
have proven above that every element is equal to itself, then this class contains no elements. 
We will call the class with no elements the empty class and denote it by Ø. 


Theorem 2.6 For any class C, Ø C C. 
Proof: 


Let C be a class. To show that Ø C C it suffices to show, by definition of “C”, that 
x E€ Ø => x EC. Any element in Ø belongs to C since Ø contains no elements; then 
Ø CC, as required.! 


2.4 Sets which are derived from other sets. 


Axiom Al “If z = y and x € A, then y € A” is a statement about elements x, y and 
the class A. Since every set is a class, axioms referring to classes also refer to sets. 
Axiom A2 is a statement which shows how to construct classes by referring to some 
property, P(x); it does not refer to properties specific to sets only. On the other hand, 
the axioms three and four are set-specific: 


Axiom A3 (Axiom of pair): If A and B are sets, then the doubleton class 
C = {A, B} is a set. 
Axiom A4 (Axiom of subsets): Every subclass of a set is a set. 


These two axioms alone will allow us to construct sets from those classes known to be 
“sets”. Axiom A8 guarantees that at least one class called “set” exists: It contains 
the words “...there exists a class A called a set that...”. We need not search any further. 


The axiom A3 refers to the set C = {A, B} as a “doubleton”. We will use the word 
doubleton when referring to two sets A and B viewed together to form a collection 
{A, B} of sets. For convenience, we will not put any restrictions on how the set B 
relates to A. For example, we can refer to the set {A, A} as a doubleton even though 
it contains only one element. 


The statements in the following theorem follow immediately from the axioms A3 and 
A4. 


1 Alternatively, the statement « ¢ C => x ¢ Ø is the logical equivalent of x € Ø > x € C. If x is not an 
element of C, then x is not an element of Ø since @ has no elements. 
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Theorem 2.7 Let S be a set. Then: 
a) Ø C S and so Ø is a set. 


b) The set S is an element. Hence, “All sets are elements”. 


Proof: 


2.5 


a) We are given that S is a set. We are required to prove that @ is a set. We can 


directly apply axiom A4: Since Ø = {x € S: x # x} and, by hypothesis, S is a set 
then, by A4, @ is a set as required. 


b) We are given that S is a set. We are required to prove that S is an element. By 


axiom A3 (Axiom of pair), for any set S, {5,5} is a set. Since S € {S,S}, for all 
sets S, then, by definition, S is an element, as required. 


In the proof above we discussed the set {5, S} which contains the set S as an element. 
Since {S,S} = {S} (Prove this!) this is a one element class. We call such sets single- 
ton sets. The reader should note that according to our definition of doubleton above, 
every singleton set {A} can be expressed as a doubleton {A, A}. 


We can now verify that the universal class Y = {x : x = x} is not a set. Suppose 
WU isa set. See that the class C = {x € Y: x Z x} is a subclass of Y. Since we 
assumed WY to be a set, by the Axiom of subset, C must also be a set. But we showed 
in theorem 2.4 that the class C is not an element and so cannot be a set. We have a 
contradiction. Therefore the universal class is a proper class, as claimed. 


Examples of sets which are non-empty. 


At this point we have only exhibited one set, the empty set @. In the following 
examples we we use some axioms to construct other sets. 


a) The set Ø contains no elements. By axiom A3, the class C = {9,2} = {Ø} isa 
set which contains exactly one element (the element @). Observe that Ø 4 {Ø} 
since {@} contains one element while @ does not. 

b) Let A = Ø and B = {Ø}. By axiom A3, C = {@,{@}} is a set which contains 
exactly two elements (the element Ø and the one element set {@}). 

c) Let a,b,c be 3 sets. Then, by repeated applications of axiom A3, 

{a, {a}, {{a}}, {a, b, c}} is a 4 element set. 

d) Let c be an element (class). Then A = {c} is a class with only one element since 
A = {x : x = c} and so, by axiom A2, A = {c} is a class. If c is known to be a set, 
A = {c} = {c,c}, so we can conclude that A is a set. 
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The class of all sets. 


Recall that the word setis a primitive concept (along with the word class). We define 
the property symbol “set(x)” to mean “zx is a set”. Then by axiom A2, Z = {zx : 
set(x)} = “all elements which are sets” forms a class. Since every set is an element 
(by theorem 2.7), we can say Z = {x : set(x)} is a subclass of the universal class, 
U ={x:x =x}. We call Z the class of all sets. From this we observe that: 


Given any property P, 
S = {zx : set(x) A P(x)} = {x: x is a set and z satisfies P} 
is a class. 


Axiom A2 said that {x : P(x)} = “all elements which satisfy property P” is a class. 
Now it makes sense to talk about the “class of sets satisfying a property P”. 


Note that the class, Z, of all sets is a proper class. To see this, suppose -Y was a 
set. Let D = {x € Z :x ¢ x}. The class D cannot be a set, for if it was, then as 
previously shown, we would quickly obtain the contradiction, D € D and D ¢ D. See 
that D is a subclass of Z. Since .Y was assumed to be a set, by the Axiom of subset, 
D must be a set. We have a contradiction. The source of the contradiction is our 
assumption that Z is a set. So -Y is a proper class. 


Power sets. 


Axiom A3 allows us to construct new sets from known ones by forming doubleton sets, 
while Axiom A4 allows us to construct sets by taking subclasses of sets and calling 
them subsets. Axiom A5 will allow us to construct, from a known set A, what seems 
to be a larger set, A(A). It is called the power set of A. We define “power set”. 


Definition 2.8 If A is a set, then we define the power set of A as being the class Y(A) of 
all subsets of A. It can be described as follows: 


P(A) ={X:XC A} 


We verify the following facts: 


— By axiom A4, “X C A” => “X is a set” so all elements of A(A) are sets. 


— By axiom A2, A(A) is a class. (The fact that “all sets are elements” is proved 
above). 


No one has been able to prove that A(A) is a set. So if we want it to be a set, we 
must postulate this fact. Axiom A5 does precisely that. We will later see why this 
axiom plays a fundamental role in the mathematical universe we are exploring today. 
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Axiom A5: If A is a set, then the power set, A(A), is a set. 


In formal language, the Axiom of power set actually reads as follows: 





VAJ3P|B € P= BCA 


where A is specified to be a set. This expression includes the definition of the “power 
set Y(A) of a set A”. The interior of the square brackets specifies that “the elements 
B of the power set Y(A) are precisely the subsets B of A”. The sequence of symbols 
“YAP” instructs the reader that given any set A, the class Y(A) exists as a set. This 
axiom also guarantees that a set A is an element since A belongs to the set Y(A). 
Declaring that the power set of any set is a set adds many sets to our universe of 
sets. There is, of course, some risk in doing this, since we may be allowing sets in our 
universe of sets which are so strange that we will not be sure whether we want them 
there or not. On the other hand, we will see that the Power set axiom is extremely 
useful for constructing sets we need. For example, the Power set axiom allows us to 
construct Cartesian products. 





The four axioms A6, A7, A8 and A9 have not yet been discussed. We will study these 
only when we require them later on. 


2.8 Examples. 


We provide a few exercises which allow us to practice notions related to power sets. 


1) Power sets. List the elements of the power set of 
a) the empty set, Ø. 
b) a singleton set. 
c) a doubleton set. 


Solution: 


a) The power set of the empty set: A(@) = {X : X C Ø}. If X € A(O), then 
X C Ø. Thus, X = Ø. So 
P (D) = {8} 
b) The power set of a singleton set {x}: For an element xz, A({x}) = {Ø,{£}}. 
Note that x Z {x} since the elements of x are not in {x}, a single element set. 
c) The power set of a doubleton set {x, y}: For the elements x and y, A({z, y}) = 


{S, {x}, {y} {v, y}}- 


2) Consider the 3-element class C = {x, {x, y}, {z}}. Determine which of the following 
statements are true and which are false. 


a) We can write x € C. 
b) We can write x C C. 
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c) We can write {x} C P(C). 
d) We can write {{z}} E€ P(C). 
e) We can write z € A(C). 

f) We can write {z} CC. 


Solution: 
a) True. We can write x € C since x is an element explicitly listed as a class in 
C. 


b) False. We cannot write x C C since this does not satisfy the definition of C. 
To write x C C is to say that every element in x is an element in C. But the 
contents of x is unknown. So there is no basis to state that x C C. 


c) True. We can write {x} C A(C) since every element in {x} is also an element 
of C. 

d) True. We can write {{z}} € A(C) since {{z}} C C. The only element in 
{{z}} is in C. 

e) False. We cannot write z € A(C) since the element z does not appear as an 
element of C. 


f) False. We cannot write {z} C A(C) since {z} contains only one element z. 
This element is not a subset of C. 


It was shown above that “all sets satisfying a property P” is a class. In the fol- 
lowing example we say something similar. But there are subtle differences in the 
statement. See if you can detect these differences. 


Let A be a set and P denote some property. Show that the class 
S = {x : (x C AAP at 


is a set. 
Solution: 


We are given that A is a set and S = {x : (x C A) A P(x)}. We are required to 
show that S is a set. Since A is a set, and, for every x € S, x C A then every x € S 
is a set (by axiom A4). By axiom A5, A(A) is a set. Since the class S C (A), 
then S is a set (by axiom A4). This is what we were required to prove. 


Concepts review: 


1. What does it mean to say “the class A is equal to the class B”, A = B? 
2. What does it mean to say “the class A is contained in the class B”, A C B? 


3. What does it mean to say the class A is a proper subclass of the class B? 
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10. 


11. 


12. 


How should we read the expression C = {x : P(x)}? 


. Is it true that @ Z Ø? Why? 

. State a class that is not an element. 
. What is the universal class? 

. What is the empty class Ø? 


. Is a set an element? 


Given a set A, what is the power set, A(A), of A? How do we know that (A) isa 
set? 


If B is a set and A C B, what can we say about A? Why? 


Why is Ø a set? 


EXERCISES 


1. Suppose A is a proper class and A C B. Show that B is a proper class. 


. Prove the following. 


a) {c, d,e} = {c, d} if and only if e = c or e = d. 
b) c= {d} >dEc 


. If x = {u,v}, y = {v, {w}} and z = {x,y} write out explicitly the elements of the 


following classes: A(x), P (y), Z(P(x)) and A(z) 


. Prove parts c), d) and e) of theorem 2.3. 
5. For sets S and T show that: 
a) S CT if and only if A(S) C A(T) 

b) S =T if and only if A(S) = A(T) 
. Show all the elements in the set A(A(@)). 
. Show all the elements in the set a 
. Show that [Sc T)|A[TCV)] = (S ) 
. Show that [S C T) ^A [T c V)| => ee CV). 


OO CON D 


12. 
13. 
14. 


. Show that c = d if and only if {c} = {d}. 
11. 


Show that c € d if and only if {c} C d. 


Show that (S C Ø) > (S = Ø). 
Using the empty set, @, construct a set containing 7 elements. 
If A= {2,9,9,{9,@}} and B = {@, {@}} show that A = B. 
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15. 
16. 


17. 
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Show that the statement “For any set S, A(S) C S” is a false statement. 
Suppose U and V are sets. Determine whether the statement A(U)U A(V) 
PZ (U U V) is true or false. Justify your answer. 


Suppose U and V are sets. Determine whether the statement A(U)N A(V) = 
P (U N V) is true or false. Justify your answer. 


Part II 


Class operations 
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3 / Operations on classes and sets. 


Summary. In this section we define operations on classes that will allow us 
to better see how classes relate to each other. The concept of unions, intersec- 
tions and complements of classes and sets are defined. The axiom A6 (Axiom 
of union) is discussed. This axiom is used to determine when the union of sets 
is a set. We show how Venn diagrams can serve as a guide when interpreting 
operations on sets. Some basic laws for the union, intersection and complements 
of classes and sets are presented in the form of theorems. De Morgan’s laws are 
also stated. 


3.1 Unions, intersections and complements of classes. 


The union and intersection of classes is a method for constructing new classes and 
sets from old ones. We begin by defining these formally. 


Definition 3.1 Let A and B be classes (sets). We define the union, AU B, of the class A 
and the class B as 
AUB={a: (x € A)V (xe B)} 


That is, x € AU B if and only if x € Aor x € B. If æ is a non-empty class of classes then 
we define the union of all classes in & as 


U C = {x : x € C for some C € #} 
CEA 


That is, x € Uccy C if and only if there exists C € & such that x € C. 


Definition 3.2 Let A and B be two classes (sets). We define the intersection, AN B, of 
the class A and the class B as 


ANB={x:(xE€ A)A(xzE€B)} 


That is, x € AN B if and only if x € A and z € B. If æ is a non-empty class of classes, 
then we define the intersection of all classes in % as 


() C={r:zEC Y Cem} 
CEA 


That is, £ € cesy C if and only if x € C for every class C in æ. 


Observe that 
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a) if £ = {D, E} then DU E = Ucew C. 
b) if £ = {D, E} then DN E = fNceg C- 


Also see that the axiom A2, Axiom of construction, guarantees that both Ugey C 
and []cey C are classes. If the class contains no elements, then by definition of 
“union” and “intersection” the union and intersection of all elements in & is Ø. 


Definition 3.3 We will say that two classes (sets) C and D are disjoint if the two classes 
have no elements in common. That is, the classes C and D are disjoint if and only if 
CAD= Ø. 


Definition 3.4 The complement, C’, of a class (set) C consists of all elements which do 
not belong to C. That is, if C is a class, then 


C'={xz:xg C} 


Hence, x € C’ if and only if x ¢ C. Again, the axiom of construction (A2) guarantees that 
C’ is a class. Given two classes (sets) C and D, the difference C — D, of C and D, is defined 
as 

C-D=CnD 


This is also a class. The symmetric difference, CAD, is defined as (the class) 


CAD =(C-— D)U(D-C) 


3.2 Unions and intersections referring specifically to sets. 


Observe that ocyC © C for all C € & since, by definition, every element in 
Mce C belongs to every C E€ &. 


— If æ is a non-empty class of sets then [ce C is a subclass of every set C. So, 
by axiom A4, ()cey C is a set. 

— Hence, we can say, in a general way, that the intersection of arbitrarily large classes 
A of sets is always a set. 


If & is a non-empty class of sets is [Joey C necessarily a set? What about the special 
case where & is a set of sets? None of the axioms Al to A5, nor any previously proven 
statement resulting from these help to answer this question. It will turn out to be 
useful if we can answer “yes” to the second question. However, we have been unable 
to prove this. Then, we need an axiom that will postulate this to be true. The axiom 
A6, Axiom of union, declares when a union of sets is a set. We restate it here: 
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3.3 


Axiom 6: If æ is a non-empty set of sets then Uccy C is a set. 


Thus, axiom A6 says “The union of all sets in a set of sets is a set”. We should also be 
clear about what axiom A6 does not say: “The union of all sets in a class of sets is a 
set.” If we make the mistake of assuming this to be true it will lead to a contradiction, 
as the following example shows. 


Example: Suppose & is the class % = {x : xisaset and« Z x}. Show that 
D = [J ey 7 is not a set. 


Solution: 


What is given: # = {x : x is a set and z ¢ x} 

What we are required to show: D = (U eg 7 is not a set. 

Suppose D = Uey is a set. Then by axiom A5, A(D) is also a set. We know 
that for every x € A, x C D = (Jregy x- (Make sure you see why. If not look at 
AC AUB.) So, for every z E€ Æ, x E€ P(D). Hence, w C A(D). 

Since æ is a subclass of a set, then, by axiom A4, æ is a set. 

We now argue as in theorem 2.4: Since æ is a set, “S ¢ A” => “A © Æ” and 
“A E A” = “A ¢ a£”. This is a contradiction. So D = U,¢, z cannot be a set. 
This is what we were required to show. 


Then the statement “The union of all sets in a class of sets is a set” is not a true 
statement, in general. Even though, in set theory, both class and set intuitively rep- 
resent a “collection of objects”, freely substituting the word set with the word class 
may lead to some nasty consequences. 


Venn diagrams. 


We often use Venn diagrams as a tool to visualize how sets relate to others. Venn 
diagrams should not be substitutes for proofs of statement; but they are helpful when 
used to guide our intuition. We represent here Venn diagrams representing some re- 
lations defined above. 


A B A B 


ANB AUB 


Figure 1: Intersection and union of two sets 
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A B A B 


A-B AAB 


Figure 2: Difference and symmetric difference 


A B 


C AN(BUC) 


Figure 3: Intersection distributing over a union. 


3.4 Basic laws for operations on classes and sets. 


We now prove a few fundamental properties of sets. The proofs of less common 
properties will be left as exercises. 


Theorem 3.5 Let C and D be classes (sets). Then 


a) CCCUD 
b) CNDCC 


Proof: 
a) Let x € C. It suffices to show that x € CUD. 


rEeC => ré€{e:xrE€Corzre D} 
> reCUD 


b) The proof is left as an exercise. 


Theorem 3.6 Let C and D be classes (sets). Then 
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Proof: 


a) What we are given: C and D are classes (sets). 
What we are required to show: C U (C N D) =C 


C 
CAD 


C U (C N D) (By theorem 3.5 a)) 


C 
C (C N D) U C (By theorem 3.5 a)) 


rE(CND)VC > zrEeECNANDorrzreEC 


> tE C or ze C (Since theorem 3.5 b) says CN D C C) 


> r£EC 
=> (CAD)UCCC 
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So C G CU (CA D) and (CN D) U C G C implies CU (CN D) = C (Def. of equal classes). 


b) The proof is left as an exercise. 


Theorem 3.7 Let C be a class (a set). Then (C'Y = C. 


Proof: 
xe(C' = «€C" (By definition 3.4) 
=> x£EC (By definition 3.4) 
> (C1) OC 


rEC = «¢€C" (By definition 3.4) 
=> x€(C’)' (By definition 3.4) 


= (OGG) 


mnm 
Q 
IQ IA 
Q 


Theorem 3.8 (De Morgan’s laws) Let C and D be classes (sets). Then 


a) (CUDY=C'nAD' 


=r (C'y =C (By definition of =) 
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b) (CN DY=C'UD' 


Proof: 


a) Given: C and D are classes (sets). 


E(CUDY = 


rg CuD 
=> xrgCandzgD 
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(By definition 3.4) 
(For if x € C or € D, then x € CUD) 


> xeC'AD 
> (CUD) co ny 


Next x € C'A D' 
(CUDY C OND 
C'AD' C (CUD) 


b) The proof is left as an exercise. 


=> 


t YUU Y 


x € C' and gz € D’ 

x¢Canda¢D_ (By definition 3.4) 

xE~CUD (Forifxe CuD, then x €C or xe D) 
e (CU DY (By definition 3.4) 

C'n D'C (CUDY 


(CUD) =C ny 


Theorem 3.9 Let C, D and E be classes (sets). Then 


a) CUD=DUC andCnD=DNC (Commutative laws) 


c) CU (DU E) = 
d) CU (DNA E) = 


) 

b) CUC=C andCnCc=C (Idempotent laws) 
) 
) 


(CUD)N 


(CUD)UE and CN (DN E)= 
(CUE) and CN (DUE) = 


(C N D) NE (Associative laws) 
(CAD)U 


Proof: The proofs of a) to d) are left as an exercise. 


Theorem 3.10 Let A be a class and Y denote the class of all elements. 


a) WUA=Y 
Lean A 
c) Y'= 
dg = 
e) AUA =% 


(C N E) (Distribution) 
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Proof: 


a) By theorem 3.5 a), V CU UA. Ifee UA, then ex € Aorz ec Y. In either case x 
is an element and so x € Y. Thus, YUAC WY. Hence, VUA= XY. 
Parts b) to e) are left as an exercise. 


3.5 Generalized distributive laws and De Morgan’s laws. 


The distributive law and the De Morgan’s laws generalize to arbitrarily large unions 
and intersections. 


Theorem 3.11 Let & be a non-empty class (set). 


a) (Ucew c)’ =Noew C” 
b) (Neew c)’ = Ucew Cc’ 


Proof: i 
a) re(U c) < x¢\(Jc 
CEA CEA 
= xgcC foral C eE x 
& xec” foral C eE g 
< se Ne’ 
CEA 


Part b) is left as an exercise. 


Theorem 3.12 Let D be a class and & be a non-empty class (set) of classes. 


a) DO (Ucew C) =Ucew(D NC) 
b) DU (New C) =Neew(D UC) 
Proof: 
a) 


repa( U c) & xreEDandzeEe U C 


CEA CEA 
= x€DandzeC for some C E€ & 


= x€ DNC for some C € & 


s xe [J (DNC) 
CEA 


Parts b) is left as an exercise. 
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Theorem 3.13 Let {Baj : i= 1,2,3,..., j =1,2,3,...} be a set of sets Then 


UZ (0321 Ba) = N0721 (UZ Bey) 


Proof: 


LE UP (9721 Busy) o> LE NF Bkz) For some k. 
=> LE Biki) For some k and all j. 
© “LE U1 Baz) For all j. 
S xE Nja UZ Bus) 


Concepts review: 


10. 


11. 


. If & is a class of classes how should we interpret the class Uceg C? 


. If æ is a class of classes how should we interpret the class (\o¢,y C? How do we know 


that this is indeed a class? 


. What does it mean to say that two classes A and B are disjoint? 
. What is the complement, C’, of a class C? 


. What is the difference, C — D, of the two classes C and D? What is the symmetric 


difference CAD? 


. If & is a non-empty set of sets how do we know that the union [Joey C is a set? 
. What do De Morgan’s laws say in reference to two classes C and D? 
. Let & be a class of classes. Can we generalize De Morgan’s laws to {C : C € #}? 


. Is it true that the union of sets C in a class & is a set? 


List the ZF-axioms that refer specifically to sets and were invoked at least once up 
to now? 


In algebra, we know about the distributive property of “multiplication over sums and 
differences”. Is there a similar property which refers to “unions distributing over 
intersections” and “intersections distributing over unions” ? 
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EXERCISES 


A. 1. Prove or disprove that if D € æ, then it is always true that D C Ucey C. 
. Show that if æ is a class of sets, then & C P(U. 7). 


N 


. Show that P(A) N P(B) = A(ANB). 
. Show that F(A) U P(B) = P(AU B). 
. Show that CN D = Ø if and only if A(C)N A(D) = {Ø}. 
. If A is a set show that UcewayC = A. 
. If A is a set show that NcewayC = Ø. 
. If A is a class show that A = U,. {2}. 
. Prove the following statements. 
a) Part b) of theorem 3.5. 
b) Part b) of theorem 3.6. 
c) Part b) of theorem 3.8. 
d) Part a) to d) of theorem 3.9. 
e) Part b) to e) of theorem 3.10. 
) 
) 


O ooN anA w 


f) Part b) of theorem 3.11. 
g) Part b) of theorem 3.12. 


We Daa aS a 


C. 10. If A and B are sets show that Y(A) € P(B) implies Y(A) C B and so A € B. 
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4 / Cartesian products. 


4.1 


Summary. In this section we define the notion of “ordered pairs” in terms of 
classes and sets. We then define the Cartesian product of two classes (sets). 
We also present a few of the basic properties of Cartesian products. 


Ordered pairs. 


The notion of “ordered pairs” is an important one since it is involved in most areas 
of mathematics. Most students are familiar with the idea of ordered pairs since they 
have learned early on that when given an ordered pair of numbers, the order in which 
the numbers appear conveys a particular meaning. 


For example, say 120 desks in an exam room are arranged in a rectangular grid of 10 
rows with each row containing 12 desks. Suppose each of the 120 students writing an 
exam in this room is given an ordered pair, (a,b), to be interpreted as follows: You 
will write your exam on a™ desk in the b** row where the first row is the one in the 
front of the room and the first desk in this row is the one which is closest to the door 
as you enter the room”. The student understands that the desk labeled (2, 3) is not 
the same desk as the one labeled (3, 2). The order in which the numbers are presented 
has meaning. 


We know that functions can also be represented by ordered pairs. For example, the 
function f(x) = z? with domain R can be represented by the set S = { (x, x”): x € R}. 
So we can say that the ordered pair (5, 25) is an element of this function while the 
pair (25, 5) is not, since the second entry is not the square of the first entry. 


We would now like to formally define “ordered pairs” in our set-theoretic axiomatic 
system. Our first step will be to remind ourselves of the way “ordered pairs” as we 
know them are defined. Someone may attempt to define an ordered pair as follows: 


Given two elements a and b, an ordered pair, (a,b), is a doubleton {a,b} 
where one element a is labeled as the “first” while the other element b is 
labeled as the “second”. The element labeled “first” must be listed first. 
The round brackets “( )” are used to indicate that (a,b) is not a simple 
doubleton but rather a doubleton where the order in which the elements a 
and b appear has a particular meaning. 


This is a bit wordy. Also, it is not clear what the words “first” and “second” mean. 
We have not defined these in our set-theoretic universe. Can we define ordered pairs 
without using the words “first” and “second”? That is, can we obtain an equivalent 
definition of “ordered pairs” by avoiding these two words entirely? Let us consider 
the following definition and then see if it works. 
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Definition 4.1 (Kuratowski definition) Given a pair of sets c and d, we can construct the 
class 


dic, te d}} 


The doubleton {{c}, {c, d}} is called an ordered pair. The sets c and d need not be distinct. 
Ordered pairs are denoted as (c, d) = {{c}, {c, d}}. 


First, we should verify that there are no inherent ambiguities in this definition. We 
verify immediately that if c and d are sets, then {{c}, {c, d}} is a set: 


cand dare sets = {c} and {d} are sets. (Axiom of pair) 
> {c} U {d} = {c, d} is a set. (Axiom of union) 
> {{c}, {c, d}} is a set. (Axiom of pair) 
Now that this has been established, we should make sure that the doubleton defined 


above satisfies the essential “ordered pairs property”, [(a,b) = (c,d)] = [a = c and 
b= d]. 


Theorem 4.2 Let a,b,c and d be sets. Then (a,b) = (c, d) if and only if a = c and b = d. 


Proof: 


oe That a = c and b = d implies (a,b) = {{a}, {a, b}} = {{c}, {c, d}} = (c, d) is imme- 
iate. 


(=) What we are given: (a,b) = (c,d). What we are required to show: a = c and b = d. 


(a,b)=(4d) > {ta}, ta ott = {{c} tad} 


Case l:a4b = {a,b} {c} hence {a,b} = {c, d} 
=> {a} ={e} 
> a=c 
{a,b} = {c,d} and a=c > bad 
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Case 2: a=b => {{a},{a, b}} = {{a}, {a, a}} = {{a}} 
= {a} = {c} and {a} = {c,d} 
> a=c 
{a} = {c,d} and a = c {a} = {a, d} 
=> {a,d} = {a,a} 
a=d 








4 


Then (a,b) = (c, d) if and only if a = c and b = d. 


From this theorem we deduce that: 
[a # b] = [(a, b) # (b, a)] 


Given the distinct sets a and b, we see that the sets {a, b} and (a, b) are indeed differ- 
ent mathematical “creatures”. The definition of ordered pair simply shows that (a, b) 
is constructed from a and b in a way that guarantees that the “ordered pair” property 
holds true. It says that (a,b) is a doubleton where one of its elements is a singleton 
while the other is itself a doubleton. Since the two elements of this set have different 
characteristics it allows us to decide which is the first entry and which is the second. 
We can say that “the singleton is the first entry”, while “the doubleton is the second 
entry”. 


Having defined an ordered pair of sets, we can conveniently define, in a similar way, 
an ordered triple (x,y,z) as an ordered pair where the “first” entry is itself an ordered 
pair: 

(x,y, z) = ((z,y), z) 


Alternate definitions of ordered pairs. There are other possible definitions of ordered 
pairs (c,d) in terms of sets. Many readers may find the following definition intu- 
itively preferable. This definition is a slight variation of the one put forward by Felix 
Hausdorff (1868 - 1942), so we will label it as being Hausdorff ’s. 


Definition 4.3 (Hausdorff) If c and d are sets, the expression (c, d) is defined as follows: 


(c, d) T { {c, Ø}, {d,{S}} 
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4.2 


The main reason why this definition may be intuitively appealing to some is that it 
looks more like we are indexing the two elements c and d with the symbols ¢ and {¢}. 
It allows one to visualize the ordered pair as follows: 


(c, d) = { {c, @},{d,{o}} } = {cø dio} = {co, di} 


This in fact resembles more the way we will be viewing ordered pairs once we define 
“functions” and the “natural numbers”. We will defer the proof which guarantees 
that this definition satisfies the essential property of ordered pairs to the end of this 
section. In this text, we will adopt the more commonly used Kuratowski definition. 


Cartesian products 


Now that we have defined ordered pairs of classes we can construct new classes with 
old ones. Recall how, from two known sets, say N and R we can construct a new set 
Nx R = {(n,2): n € N,x € R}. This is what we want to do with classes. Any 
two classes (sets) C and D can be used to construct another class (called a Cartesian 
product) whose elements are ordered pairs. 


Definition 4.4 Let C and D be two classes (sets). We define the Cartesian product, C x D, 
as follows: 


Cx D={(c,d):ce€ C andde D} 


We could of course also write C x D = { {{c}, {c,d}} :c € Candd € D}. Since 
we are particularly interested in constructing new sets from old ones, we should first 
make sure that as long as C and D are sets, then C x D is a set. We will do this by 
first proving the following lemma. 


Lemma 4.5 Let C and D be two classes (sets). Then the Cartesian product, C x D, of C 
and D satisfies the property Cx DC A(A(CUD)). 


Proof: 


What is given: That C and D are two classes (sets). 
What we are required to show: Cx DC A(A(CUD)). 


Let c € C and d € D. It will suffice to show that (c,d) € A(A(CUD)). 
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{c} € P({c, d}) and {c,d} € P({e, d}) {{c}, te, dt} © Alte, d}) 
(c,d) © P({c, d}) 
(c,d) € P(P({c, d})) 


(c,d) € P(A(CUD)) 


{yy 


P( Pfc, d})) © P(A(CUD))! 


Hence, C x DC P(P (CU D)), as required. 
İt To see this verify that S C T > P(S) C A(T) 


Theorem 4.6 If C and D are classes, then the Cartesian product, C x D, is a class. If C 
and D are sets, then C x D is a set. 


Proof: 


To show that C x D is a class we can express C x D as 


Cx D={xa:24€ A(A(CUD)) and x = (c,d) for some c € C and some d € D} 


and invoke axiom of construction A2 which declares it to be a class. 

Given that C and D are sets, then CUD is a set (by the Axiom of union A6) which implies 
that Y(A(CUD)) is a set (by the Axiom of power set A5). Since Cx DC A(A(CUD)), 
then C x D is a set (by the axiom of subset A4). 


We can then write that if C and D are sets, C x D is the set of all those specific el- 
ements u in Y(A(CUD)) which are of the form u = (c, d) for some c in C and d in D. 


Once we have defined the Cartesian product of two classes C and D, referring to our 
definition of ordered triples (c, d,e) = ((c,d), e), we can define the Cartesian product 
of three classes C, D and E as follows: 


CxDxE {(c,d,e): cE C,de€ Dee E} 
{((c,d),e): cE C,de D,ee E} 


= (Cx D)x# 


4.3 A few properties of Cartesian products 


The following theorem illustrates properties of Cartesian products involving the sym- 
bols intersections, “N”, and union, “U”. 
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Theorem 4.7 Let C, D, E and F be classes. Then 


a) Cx (DN E)=(Cx D)N(Cx E) 
b) Cx (DUE) =(C x D)U(C x E) 
c) (CAE)x D=(CxD)A(Ex D) 
d) (CU E) x D= (C x D)U (Ex D) 
e) (CUD) x (EUF) =(C x E)U(Dx E)U(C x F)U (Dx F) 
f) (CN D)x (ENP)=(Cx BE)N(Dx E)N(CxX FP)N(DxF 
Proof 
a) (c,d)€ Cx (DN E) c€Candde (DN E) 


c€Candde€ Danddc E 
(c,d)€ Cx D and (c,d)E CX E 


=> 
=> 
=> 
& (c,d) €(Cx D)N(Cx E) 


Hence, C x (DN E) =(C x D)N(C x E) (by axiom A1). 


Proofs of parts b) to f) are left as an exercise. 


Theorem 4.8 If CC D and EC F,thenCx ECDxF 
Proof: 
By definition C x E = {(c,e):c€ C and e € E} and 
Dx F={(d,f):de D and fE F}. 
(ceJECXE => ceEeCandee E 
= ce Dandee F (Since C C D and E C F) 
> (ce)eDxF 
Hence, Cx EC Dx F. 


The following theorem shows that there is a one-to-one correspondence between the 
elements of S x (U x V) and the elements of (S x U) x V. 


Theorem 4.9 Given three classes (sets) S, U and V there is a one-to-one correspondence 
between the two classes (sets) S x (U x V) and (S x U) x V. 
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Proof: 


Let 6: Sx (U x V) — (S x U) x V be defined as: ((s, (u, v))) = ((s,u), v). We will 
prove that ¢ maps distinct elements in S x (U x V) to distinct elements in (Sx U) x V. 
We can prove this by invoking theorem 4.2 as follows: 
(s, (u, v)) = (a, (b,c)) © s=a and (u,v) = (b, c) 

& s=aandu=bandv=c 

= (s,u) = (a,b) andv=c 
= ((s,u),v) = ((a, b), c) 
> 


(s, (u, v)) = o((a, (b, c)) 


4.4 Proof for the Hausdorff definition of ordered pairs 


We end this section with a proof showing that the alternate of 


(c, d) = { 16; Ø}, {d, {S}} } 


satisfies the essential property of “ordered pairs” and so can also be used to represent 
ordered pairs. 


Theorem 4.10 For classes c, d, eand f, if (c, d) = {{c, Ø}, {d, {@}}} and (e, f) = {{e, Ø}, {f, {a} }}, 
then (c,d) = (e, f) if and only if c = e and d = f. 


Proof: 
(<) That c = e and d = f implies (c, d) = (e, f) is immediate. 


(=) What we are given: 
` (c, d) = {{¢, Ø}, {d, {S}}} 
- (e, f) = tte, St, {Ff {o}}} 
- (c,d) = (e, f) 
What we are required to show: c = e and d = f. 


We first consider the case where c is the empty class. 


(Ø,d)= (e, f) = {{2, 2}, {4, {O}}} = (le, Oth {oth} 
= {{2},{d,{S}}} = {{e, 2}, {f {ohh} 
{{S}, {d, {2}}} = {{2, 2}, {f, LPH} (Since (7, {@}} can never equal (9}.) 
{{2}, {d, {S}}} = {12}, Cf, {abt 
{d,{2}} ={f, {2} 
d= f 


{yy 


A. 
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Thus, (Ø, d) = (e, f) > e = @ and d = f as required. We now consider the case where 


cø. 


(c d)= (e, f) > {{o g}, {d {2}} } = tte, 2}, {f {oh} } 


If {c, Ø} = {e, Ø} = c= eand it quickly follows that d = f. ( check the details.) 








If {c, O} # {6,8} > {c} = {f,{2}} and {d, {}} = {e, Ø} 
{c, Ø} = {f,{}} => f=Ø (Since g ¢ {Ø} this forces f = 2.) 
> {Øø}=c 
{d, {o}} = {e, oh => d= Ø and e = {Ø} ( For the same reasons as above.) 
c= {Ø} = e and d= Ø = f c=eandd= f. 


Note that the two different representations of ordered pairs (a,b), {{a}, {a, b}} and 
{{a, Ø}, {b, {@}}} do not form equal sets. These two classes only share the funda- 
mental property of ordered pairs. 


Concepts review: 


. What is the (Kuratowski) definition of the ordered pair (c, d)? 

. Given two classes C and D what is the definition of C x D? 

. If C and D are sets is it true that C x DC A(A(CUD))? Why? 

. Is it generally true that C x D = D x C? If so, why? If not give a counterexample. 


. Is it generally true that (C x D) U (E x F) = (CUE) x (DU F)? If so, why? If not 


give a counterexample. 


EXERCISES 


1. Prove that C x D = Ø if and only if C = Ø or D = Ø. 
2. Show that for classes C, D and E, (C x D)N(C' x E) = Ø. 
3. Show that ACBS>AxCCBxC. 
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. If (c,d) € C x D is it necessarily true that {c} € C and {c,d} € D? If so, why? If 


not, give a counterexample. 


. Suppose A, B and C are sets. Show that Ax CC Bx C = AC B, that is, the 


converse of the statement in question 3) holds true. 


6. Prove parts b) and c) of theorem 4.7 on page 39. 
7. Let S = {x} be a set. Show that (S x S)x S#Sx (SxS). 


11. 


. Describe each of the following classes. But first explain why each of these classes is 


a set. 

a) Ø x {Ø} 

b) {Ø} x Ø 

c) Ø x Ø 

d) {Ø} x {2} 
) 


e) {Ø x Ø} 


. Show that C x (D — E) = (C x D) —- (C x E). 
10. 


Is the statement “C x D = E x F if and only if C = E and D = F” always true? If 
there are situations where it fails to be true, state which ones. 


Show that if a and b are sets, then {{a, Ø}, {b,{}}} is a set. 
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5 / Relations on a class or set. 


Summary. In this section we define a relation R on a class (a set) S. For 
a relation R on a set S we define the inverse R7! of the relation R. We also 
define the domain and the image of R. The composition of two relations R and 
T is defined and some of their properties are given. 


5.1 Relations on a class or set. 


Recall that the symbol Y denotes the “Universal class”, {x : « = x}. Since Y isa 
class, we can then construct the Cartesian product, Y x Y, itself a class (as we have 
seen). Recall that the elements of Y x Y are ordered pairs. 


Definition 5.1 a) We will call any subset R of ordered pairs in Y x Y a binary relation.+ 


b) We will say that R is a binary relation on a class C if R is a subclass (subset) of 
C x C. In such cases we will simply say that R is a relation in C (or on C). 


c) If A and B are classes (sets) and R is a subclass (subset) of A x B then R can be 
viewed as a relation on AUB. 


From this definition we see that any Cartesian product C x D is a relation. But a 
relation need not be a Cartesian product. For example, the smallest Cartesian product 
which contains the set G = {(a,c), (b,d)} is A x B where A = {a,b} and B = {c,d}. 
But G # A x B since (a,d) € (A x B) - G. 


We will be referring to specific kinds of relations on a class or set. Given a class C, a 
relation on Č is usually expressed by the symbol, R, although other capital letters are 
used whenever it is necessary to distinguish between two relations on a same class. 
Suppose R is a relation on a class C and that (x,y) € R. Then, by definition, both x 
and y belong to C. Common ways of expressing that (x,y) belong to the relation R 
are: 


j- (x, y) ER 
- xRy holds true 


- x is related to y under R. 


'tThe word binary refers to fact that the elements of R are doubletons (pairs). We can also speak of 
a ternary relation when considering subclasses of the Cartesian product Y x Y x Y. Unless we specify 
otherwise, all relations in this text are assumed to be “binary” and so the word relation will be used to 
abbreviate the words binary relation. 
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Remark: Recall from lemma 4.5 that C x DC Y(A(CUD); hence, if R is a relation 
subset of C x D then RC A(A(CUD). 


5.2 Examples of relations on a class. 
We provide a few examples of relations in Y. 


1) We will define a relation, R1, in Y as follows: (x,y) € Ry if and only if x € y. 


This says that “any x is related precisely to those classes (sets) y which contain 
it.”. We can also write 


Ri ={(z,y): 2 € y} 


For example, we can write (a, {a, b}) € Rı or, if one prefers, aR; {a, b} “holds true”. 
On the other hand, we can write (b, {c,d}) ¢ Ry. Also, (Ø, Ø) ¢ Rj. 

2) We define a relation, R2, in YW as follows: (x,y) € Rə if and only if £x C y where 
“r C y” means “x CyandaAzy”. 
This says that a class (a set) x is related precisely to those non-empty classes (sets) 
y # x which contain the elements of x. Since x = x, then x É x; hence, (x, x) ¢ Ro. 
Assuming that a 4 {b}, we see that (a, {a,b}) Z Ro, but that ({a}, {a,b}) € Ro. 
The statement «R2@ is false for all classes x including the class Ø. 

3) We define a relation Rg in Y as follows: (x,y) € Rg if and only if x = y. 
This says that a class (a set) x is related only to itself and no other class. We can 
write Rg = {(x,y) : x = y}. We see that (a, {a}) ¢ Rg but that ({a}, {a}) € R3. 
The statement @R3@ is true. 

4) We define a relation R4 in X as follows: Ry = {(x,y): aC y}. 
This means two elements x and y are related only if x is contained in y. Notice 
that if x Æ y and (x,y) € R4, then it is impossible for (y, x) to belong to Ry. The 
ordered pair, ({@},{2,{@}}), belongs to R4 but ({9,{@}},{@}) does not. We 
also see that the pair (x, {x}) ¢ R4. 


Definition 5.2 Let C be a class (a set). 
a) The relation 
Ec= {(x,y) : (x,y) € C x Cx € y} 


is called the membership relation on C. 
b) The relation 
Ido = { (x,y) : (x,y) €E C x C, x= y} 


is called the identity relation on C. 


Part III: Relations 47 


5.3 


We see that the only elements in the identity relation Idg on Y are those of the form 
(xr): 


The domain and the image of a relation. 


The reader may see some similarities between the concept of a relation on C and what 
is known to be a “function from a set C to C”: 


— Both relations and functions are collections of ordered pairs. 


— In both cases, the first entry should not be confused with the second entry. The 
second entry is often defined in terms of the first entry: That is, a rule states why 
the second entry is related to the first. This rule is the mechanism which allows us 
to determine which ordered pairs belong to the relation or the function and which 
don’t. 


Definition 5.3 Let R be a relation on a class (set) C. The domain of R is the class, 
dom R = {x : x € Cand (x,y) € R for some y € C}. The image of R is the class, 
imR = {y : y € Cand (2,y) € R for some x € C}. The word range of R is often used 
instead of “the image of R”. If R C Ax B is viewed as a relation on AUB, then dom R C A 
and im R C B. 


5.4 


Example: Suppose R is the membership relation on the set S where S is defined as, 


S ={ a,b, c, {a}, {a,b}, {{c}}, 2, {Sh} 
That is, R = { (x,y): x € y}. Find dom R and im R. 


Solution: 


To find the domain and the image of R we will write out the elements of R explicitly: 
R= { (a, {a}), (a, {a,b}), (0, {a,b}), (Ø, {18}) } 


The domain, dom R, is 
dom R = {a, b, Ø} 


while the image, im R, is 


im R = {{a}, {a,b}, {Ø}} 


The inverse of a relation R on a set S. 


Just as for one-to-one functions, we can speak of the inverse of a relation R on a set 
S. However, a relation need not be “one-to-one” to have an inverse. “One-to-many” 
relations are quite common. We will begin by formally defining what we mean by the 
inverse of a relation. 
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Definition 5.4 Let C be a class (a set) and let R be a relation defined in C. The inverse, 
Rt, of the relation R is defined as follows: 


R™ = {(x,y):(y,2) € R} 


In the example above we defined a relation on the set 
S= { a, b,c, {a}, {a, b}, Cane ©, {S}} 
as R= {(x,y):x E€ y}. 


Since R was found to be: 


R= į (a, {a}), (a, {a,b}), (b, {a,b}), (8, 19}) } 


then the inverse relation, R7!, is 


R™ ={ ({a},a), ({a,b},a), ({a, b}, b), 3}, 2) } 


The inverse of R can also be expressed in the more succinct form 
R = { (x,y): y € £}. 


5.5 The composition of two relations R and T. 


Just like pairs of functions f and g, a pair of relations R and T on a class C can 
be combined to obtain a new relation. Other than the fact that the first entries of a 
relation can be associated to many values in the image of R, compositions of relations 
work exactly like the composition of functions. We define the composition of two 
relations as follows. 


Definition 5.5 Let C be a class (a set) and let R and T be two relations in C. We define 
the relation T° R as follows: 


T°- R = { (x,y) : (z, y) € T where (x,z) € R} 


Suppose, for example that the relations R and T on the set 


S= { a,b,c, {a}, {a, b}, {ich}, ©, {ø} 
are defined as: 
R = {(z,y): vey} 
T = {(z,y):x= {y}} 
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Then the relation R can be described as: 


R= į (a, {a}), (a, {a, b}), (b, {a,b}), (9, 19}) } 


The relation T is: 
T= { ({a}, a}), ({S}, S})} 


For the relation T° R we obtain: 


T° R = {(a,a), (Ø, Ø)} 


For the relation ReT we obtain: 


RT = {({a}, tat), {St {2})} 


Concepts review: 


1. Given a class C, what is a relation on C? 

2. Given a relation R on a class C, what does the expression xRy mean? 
3. Given a class C what is the membership relation, Ec, on C? 

4. Given a class C what is the identity relation, Idc, on C? 


5. Given a relation R on a class C what is the domain, dom R, of R and the image, im 
R, of R? 


6. Given a relation R on a class C what is the inverse, R7}, of the relation R? Is R! a 
relation on C? 


7. Does a relation R on C have to be “one-to-one” for R~! to be a relation? 


8. If R and T are two relations on a class C, what does ReT’ mean? 


EXERCISES 


1. Suppose R, S and T are three relations on a set A. Prove that T° (R-S) = (T° R)’ S. 
2. Suppose R is a relation on a set A. Prove that (R~')~! = R. 
3. Let R = {(a, a), (a,c), (c,c), (c,d)} and T = { (a,b), (c, a), (d,c)}. Describe: 

a) R7! andre! 

b) ReT and T°R. 

c) ReT7! 
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4. If Ris a relation on a set S show that dom R7! = im R. 


5. Let C= {9,{S}} and D = AC). 
a) Write out explicitly the elements of the membership relation, €p, on D. 
b) Write out explicitly the elements of the identity relation, Idp, on D. 
c) Describe the dom €p and im€p. 
d) List all elements of D = YA(C). 
) 


e) List all possible relations on C. 


6. Let R, S and T be three relations on a set A. 
a) Prove that (RU S)eT = (ReT) U (S°T). 
b) Prove that (RA S) HR ns 
c) Prove that R C T implies R7} C T7!. 
d) Prove that dom (S U T) = dom § U domT. 
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6 / Equivalence relations and order relations. 


6.1 


Summary. In this section we define special types of relations on a class or 
set: reflexive, symmetric, antisymmetric, asymmetric and transitive relations. 
Equivalence relations on a class S will be defined as those relations which are si- 
multaneously reflexive, symmetric and transitive. We also define “partial order 
relations” and “strict order relations” on sets and provide examples of these. 


A few special types of relations on a class S. 


Although we would not normally think of it as a relation, the empty set is a relation 
on any non-empty set S: 


B=OGxS={(z,y):ceEeSyeS}osxs 
Its main properties are: 
i) dom Ø = {x € S : (x,y) E Ø} = Ø, 
ii) O* = {(x,y) : (y, £) € Ø} = Ø, 
iii) im Ø = {y E S : (x,y) E€ Ø} = Ø. 


We define other special types of relations below. 


Definition 6.1 Let S be a class and R be a relation on S. 


a) 
b) 


We say that R is a reflexive relation on S if, for every x € S, (x,x) € R. 
We say that R is a symmetric relation on S if, whenever (x,y) € R, then (y, £) € R. 


We say that R is an antisymmetric relation on S if, whenever (x, y) € Rand (y, x) € R, 
then x = y. 


We say that R is an asymmetric relation on S if, whenever (x, y) € R, then (y, x) £ R. 


We say that R is a transitive relation on S if, whenever (x,y) € R and (y,z) € R, 
then (x, z) E€ R. 


We say that R is an irreflezive relation on S if, for every x € S, (a,x) £ R. 


If, for every x,y E€ S where x Æ y, either (x,y) E€ R or (y,x) € R, then we say “any 
two elements a and b in S are comparable under the relation R”. 


It follows from these definitions that a relation which is both antisymmetric and 
irreflexive must be asymmetric. We illustrate these relation properties with the fol- 
lowing four examples. 
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1) Let G denote the set of all individuals in Gotham City. Consider two relations on 
G: 
R {(f,b) : f is a female and b is a brother of f} 


T = {(a,y): x and y are distinct siblings.} 


* R is irreflexive since a female inhabitant of this city cannot be her own brother. 
The relation T is also irreflexive since a person cannot be two children of the 
same biological parents. 


* R is an asymmetric relation on G since if (f,b) € R, then (b, f) ¢ R since b 
is male. However, x and y are distinct siblings in whichever order we consider 
them. So T is symmetric. 


* Transitivity: Since both (a,b) and (b,c) cannot belong to R (since b is male), 
then we will say that R is “vacuously” transitive. The relation T is transitive 
since if x and y are siblings and y and z are siblings, then x and z are siblings. 


* If we assume that Gotham City contains more than one family, then there are 
pairs of individuals which are not comparable under both R an T. 


2) Let S = {a,b,c,d}. Consider the relation Rı = {(a, a), (b,b), (c, c), (d, d), (a, b)} on 
S. 


* R, is a reflexive relation on S since Rı contains (x, x) for each x € S. 

* R; is not a symmetric relation on S since R; contains (a,b) but not (b, a). 

* R; is “vacuously” antisymmetric on S since Rı contains (a,b) and that (b, a) is 
not in R,.! 

* Ry, is not asymmetric since Rı contains (a, a). 

* R, is transitive on S since Rı contains (a,b) and (b,b) and also contains (a, b). 


3) Let S = {a,b,c,d}. Consider the relation Rə = {(a, a), (b,b), (d, d), (a, b)}. 


* Rə is not a reflexive relation on S since Rg does not contain (c,c). It is not 
irreflexive since it contains (a, a). 

* Ro is not a symmetric relation on S since Rg contains (a,b) but not (b, a). 

* Rg is vacuously antisymmetric since Ry contains (a,b) and (b, a) is not in R?. 

* Ro is not asymmetric since Rə contains (a, a). 

* Rə is transitive since Rə contains (a,b) and (b,b) and also contains (a, b). 


4) Let S = {a,b,c,d}. Consider the relation 


Rs = {(a, a), (b, b), (c, c), (d, d), (a, b), (b, a), (b, c), (c, b)} 


* We see that Rs is a reflexive relation on S. 


‘Note that the statement “whenever (a,b) and (b,a) are in S, then a = b” holds true. 
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* Since Rg contains both pairs { (a,b), (b,a)} and {(b, c), (c,b)}, then Rg is a sym- 
metric relation on S. 

* Since R contains {(a, b), (b,a), (a,a)} and {(b, c), (c,b), (b,b)}, then Rg is anti- 
symmetric. 

* Since Rg contains (a,a), then R3 is not asymmetric. 

* Since R3 contains the triples {(a, b), (b,a), (a,a)} and {(b,c), (c, b), (b,b)}, then 
Rg is transitive on S. 


A word of caution: Some readers may conclude that a relation R which is both 
symmetric and transitive on a class S' is automatically reflexive based on the following 
reasoning: 


Symmetric R says “(a,b) € R implies (b,a) € R” while transitive R says 
“(a,b) and (b,a) in R implies (a,a) E€ R”. So “symmetric + transitive > 
reflexive” . 


This conclusion is however not correct. 

Consider the relation R = {(a, a), (b, b), (d, d), (a,b), (b,a)} on the set S = {a, b,c, d}. 
It is symmetric and transitive and yet (c,c) is not in R and so R is not reflexive. We 
should remember that if a relation R is to be reflexive on S we must have (x, x) € R 
forallxe S 


6.2 Equivalence relations on a class S. 


Equivalence relations are important types of relations on classes and sets. Students 
who study other fields of mathematics will frequently encounter sets equipped with 
this type of relation. 


Definition 6.2 Let S be aclass and R be a relation on S. We say that R is an equivalence 
relation on S if R is simultaneously reflexive, symmetric and transitive on S. 


Examples: Let S and T be two non-empty classes. 
a) Recall that Ids denotes the identity relation on S: 
(x,y) € Ids if and only if x = y 


In this relation an element is only related to itself and no other. It is easily seen 
that Ids is reflexive, symmetric and transitive on S. Thus, the identity relation, 
Ids, is an equivalence relation on S. 
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b) Let D = S x T. We define a relation, R, on D as follows: 


((a, b), (c,d)) € R if and only if a = c 


This means all ordered pairs in S x T with the same first entry are related under 
R. Since 
i. ((a,b), (a, b)) € R) for all (a,b) € D so R is reflexive on D. 
ii. ((a, 6), (c,d)) € R = a = c > ((c, d), (a,b)) E€ R so R is symmetric. 
iii. ((a, b), (c,d)) € R and ((c, d), (e, f)) € R= a= c= e > ((a,b), (e, f)) E R 
so R is transitive. 


We conclude that R is an equivalence relation on D. 


c) We will refer to the first example presented on page 52. If G denotes all the 


inhabitants of Gotham City and H is the relation on G defined as 
H = {(a,y): x and y are siblings or the same person} 


then H is reflexive, symmetric and transitive and so forms an equivalence relation 


on G. 


6.3 Order relations on a class S. 


We now discuss another very important type of relation called “order relation”. An 
order relation will be either strict or non-strict. In each of those two categories an 
order relation can be a partial ordering or a linear ordering. These terms are defined 
below. 


Definition 6.3 Let S be a class. 


a) Non-strict order relation. The relation R is a non-strict order relation on S if it is 
simultaneously reflexive (aRa holds true for any a in S), antisymmetric (if aRb and 
bRa then a = b) and transitive (aRb and bRc implies aRc) on S. A non-strict order 
relation, R, on S is said to be a non-strict linear order relation if, for every pair of 
elements a and b in S, either (a,b) € R, (b,a) E R ora = b. That is, every pair of 
elements are comparable under R.' A non-strict ordering, R, on S which is not linear 
is said to be a non-strict partial ordering relation on S.? 


1A class on which is defined a linear ordering R is also said to be fully ordered or totally ordered by R. 


In certain branches of mathematics, “linearly ordered set” is abbreviated as l.o.set or simply called loset. 


? Again in certain branches of mathematics, “partially ordered set” is abbreviated as p.o.set or simply 


called a poset 
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b) Strict order relation. The relation R is a strict order relation if it is simultaneously 
irreflexive ((a,a) ¢ R), asymmetric ((a,b) € R = (b,a) ¢ R) and transitive on S. 
If every pair of distinct elements, a and b, in S' are comparable under a strict order 
relation, R, then R is a strict linear ordering on S. Those strict orderings which 
are not linear are called strict non-linear orderings or, more commonly, strict partial 
ordering relation. 
A non-strict partial order R on S always induces a strict partial order R* by defining 
aR*b = [aRb and a Æ bj. Similarly, a strict partial order R on S always induces a 
non-strict partial order R' by defining aRtb > [aRb or a = b]. 


Note that none of the relations defined above are equivalence relations since a partial 
ordering relation is normally not symmetric, while the strict ordering relation is not 
reflexive. 


At first glance, the reader may find it difficult to distinguish one relation from the 
other or remember precisely what they mean. Studying the following few examples 
carefully will help construct a mental representation of the structure these relations 
provide to sets. 


Example 1. 


Mortimer is constructing a chart in which he will list all of his ancestors. The set of 
all his ancestors is represented by S. He defines an order relation R on S as follows: 
If a and b are two ancestors, (a,b) € R only if b is an ancestor of a — equivalently, a 
is a descendant of b. We list some properties of the relation R: 


— We see that R is transitive since, “a is a descendant of b” and “b is a descendant 
of c” implies “a is a descendant of œ”. 


— Since an element a cannot be an ancestor of a, R is irreflexive. 


— Finally, if a is a descendant of b, then clearly b cannot be a descendant of a. So 
R is asymmetric. 


We conclude that R is a strict order relation on S. Instead of writing (a,b) € R, we 
will write a < b with the understanding that “<” is only to be interpreted as “a is a 
descendant of b”. We list a few more properties of R: 


— It is clear that R does not linearly order S since one parent of Mortimer cannot 
be an ancestor of the other parent (excluding cases where something highly 
unnatural is going on). Hence, there exists pairs of ancestors a and b such 
that a £ b and b £ a. 


— Let’s assume that Mortimer has included himself in the set S and is represented 
by the letter M. Then M < a for all a € S. We will say that M is the minimum 
element of S with respect to the ordering “<”. 
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— Beginning with M, Mortimer can trace different paths upwards forming chains 
of inequalities each in the form M <a<b<c<.---<.---. Such chains are 
linearly ordered subsets of S since for any two elements a, b in such chains either 
a<borb<a. So, not only is M the minimum element of S, M is also the 
minimal element of each chain. 


To allow us to illustrate in this example as many properties of ordered relations as 
possible, let’s assume that Adam and Eve were “spontaneously generated” and so 
were the most ancient of Mortimer’s ancestors (assuming Adam and Eve are the only 
human beings which were spontaneously generated). Say that in the set S, Adam is 
represented by A and Eve by E. We add another few properties of the set S when 
equipped with the given order relation R: 


— We see that there are numerous chains of elements (linearly ordered subsets of 
S) each of which begins with the minimal element M and finishes with either A 
or E. 


— If a chain C linearly links M to A, then A is a maximal element of this chain 
in the sense that all elements of S which are comparable to A are “below” A. 
Similarly, E is the maximal element of all chains which link M to E. 


— However S has no maximum element since A is not an ancestor of E and E is not 
an ancestor of A. The elements A and E are simply not comparable under R. 
In this sense, we can say that S has no maximum element and only two maximal 
elements. 


We now formalize some of the concepts illustrated in this example with the following 
definitions. 


Definition 6.4 Let S be a class and R be an order relation on S. If R is a non-strict 
order relation, “(a,b) € R” is represented as “a < b”, and if R is a strict order relation, 
“(a,b) E€ R” is represented as “a < b”. If a < b and a Æ b, we will simply write a < b. 


a) 


b) 


A subset of S' which is linearly ordered by R is called a chain in S. If R linearly orders 
S, then S is a linearly ordered subset of itself and therefore is a chain. 


An element, M, of S is called a maximal element of S with respect to < if there does 
not exist an element b in S such that M < b. An element m of S is called a minimal 
element of S with respect to < if there does not exist an element b in S such that 
b<m. 


Let M and m be elements which are comparable (with respect to <) to all elements 
of the set S. The element, M, in S is called the maximum element of S with respect 
to < if there does not exist an element a in S such that M < a. The element m in 
S is called the minimum element of S if there does not exist an element a in S such 
that a <m. 
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Note that a mazimum or minimum element of an ordered set S must be comparable 
to all elements of S. This is not necessarily the case for a maximal or minimal ele- 
ment of S. Referring to the example on “ancestors” of Mortimer above, S has two 
maximal elements A and E; it has no maximum element, since no single element in 
S is strictly larger than all other elements. But not much is needed to give to S$ 
a maximum element: Suppose that only Adam was “spontaneously generated” and 
Eve was somehow formed from one of Adams ribs; then Adam would be Eve’s only 
ancestor and so A would be the maximum element of S. 


Example 2. 


Let S denote the set of all molecules constructed from the atoms listed in the periodic 
table of elements. In this case, molecules are viewed as sets whose elements are atoms. 
(We exclude crystals.) The simplest molecules are those that contain only one atom. 
We define the relation R on S as follows: (a,b) € R if a Æ b and all atoms in molecule 
a are contained in molecule 6 and any atom which appears n times in a also appears 
n times in b. If (a,b) € R, we will write a C b (or say that a is a proper subset of b). 
We describe the structure of S when equipped with this particular relation R. 


— By definition, R is irreflexive. 


— If molecule a is a proper subset of molecule b, then 6 cannot be a proper subset 
of molecule a and so R is asymmetric. 


— Ifa cband bcc, then a C cand so R is transitive. 


We conclude that the relation R strictly orders the set S. The set S is however not 
linearly ordered since well-known molecules such as Clg (gaseous chlorine) and Hə 
(hydrogen gas) are not comparable under “Cc”. We discuss a few more properties of 
S when equipped with R. 

— Since a molecule made of a single atom cannot be properly contained in any other 
molecule, “single-atom molecules” are minimal elements of S. So S has as many 
minimal elements as there are atoms. Clearly, S does not contain a minimum 
element. 


— There are atoms belonging to the family of noble gases (helium (He), argon (Ar), 
krypton (Kr), etc.) that are non-reactive and so do not tend to bond with other 
elements to form molecules. In S, these elements will form one element chains. 
For example, the helium atom is not properly contained in any other molecule. 
It is both a maximal and minimal element of S. These particular atoms (noble 
gases) form in S what is called an antichain. An antichain is a subset of an 
ordered set in which no two elements are comparable. 

— Other molecules will join together to form new molecules. For example the 
carbon element, C, and hydrogen element, H, both belong to the molecule, CH2, 
which will join with some other molecules containing carbon and oxygen atoms 
to form C2H204. 
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— The order structure of S will then contain numerous chains of molecules. One 
suspects that at some point each of these chains may attain some extremely large 
molecule which is non-reactive or will be too unstable to form other lasting links. 
If this is the case, then such a molecule is a maximal element of S. However, S 
cannot have a maximum element since elements such as those belonging to the 
noble gases are not contained in any molecule. 


Example 3. 


Consider the set Z of all integers. If m and n are non-zero integers, we will say that 
“m divides n” if there exists a positive integer k such that mk = n. We will define 
the relation R on Z as follows: 


R= {(0,n):n€ Z}U {(m,n) : m divides n} U {(m, n): m <0 and n > 0} 


We first determine the most elementary properties of this relation. 


— For every non-zero integer m, m x 1 = m so (m,m) € R. Also (0,0) € R. Then 
R is a reflexive relation on Z. 


— Let m, n and k be non-zero integers in Z. If (0,m) € R and (m,n) € R, then 
(0,n) € R. By considering each of the possible positive-negative cases for the 
values of m, n and k we see that if (m,n) € R and (n, k) € R, then (m,k) € R. 
Then R is a transitive relation on Z. 


— If mand n are both negative or both positive, then if m divides n and n divides 
m, then it must be the case that m =n. If m is negative and n is positive, then 
(m,n) € R but (n,m) € R. Finally (0,m) € R for all m, but (m,0) € R only if 
m = 0. We conclude that R is antisymmetric. 


We conclude that the relation R partially orders the set Z. The expression “a < b” 
will be another way of stating that (a,b) € R. We will writea < b if a < b and a £ b. 
We see that R does not linearly order Z since neither “3 divides 5” nor ”5 divides 3“ 
holds true. So 3 and 5 are not comparable under R. We list a few more properties of 
the order structure imposed on Z by R. 


— It is explicitly stated in the definition of the relation R that 0 is the minimum 
element of Z. Given any positive integer n, n + 1 does not divide n and so n 
cannot be a maximum element of Z. So no integer is a maximum element of Z 
with respect to the relation R. 


— We will say that a chain C is a mazimal chain if no larger chain in Z properly 
contains C. If p represents a positive prime number, maximal chains must begin 
with 0 < —1 < —p, when listed in the order dictated by R. For example, 





0 < —1 < —3 < —6 < —12 < —24 <... <1 <7 <14<28<56<.-.-. 
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is an example of a chain with respect to the order relation R. But many distinct 
chains may begin with 0 < —1 < —3. For example, 


0< —1 < —3 < —9 < —27 < -54 < -<1 <2 <4<12<24<... 





In the order structure defined by R all maximal chains contain the number one. 


Example 4. 


Let C denote the class 
{S : S is a set and S € S} 


It seems that up to now, we have avoided discussing sets S which satisfy the property 
“S € S”; nor have we proved that such sets don’t exist. We could put forward a 
set theory axiom which excludes such sets from the universe of sets, but a this point 
there seems to be no strong reason to deny the existence of such sets.! For now, we 
will assume that the class C is non-empty. Suppose T is a set, ordered by €, which 
contains an element a such that a € a. Let U = {a} be a subset of T linearly ordered 
by €. Suppose m is a minimum element of U. Then m = a; but we know that 
a E m =a. This contradicts our definition of “c-minimum”. So U cannot contain a 
minimum element with respect to €. The linearly €-ordered set U = {a} may appear 
a bit strange to many readers who wonder “How can a set which contains a single 
element not have a <-minimum element?” . For now, we will let such sets be. If these 
become a source of important contradictions we will then see if a special axiom is 
required. 


Concepts review: 


6. 


7. 


. What is the “empty relation” on a class S? 

. What does it mean to say that the relation R on S is reflexive? 

. What does it mean to say that the relation R on S is irreflexive? 
. What does it mean to say that the relation R on S is symmetric? 


. What does it mean to say that the relation R on S is asymmetric? 


What does it mean to say that the relation R on S is antisymmetric? 


What does it mean to say that the relation R on S is transitive? 


1The Axiom of regularity will have something to say about this. 
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8. What does it mean to say that any two elements a and b in S are comparable under 
the relation R? 

9. What is an equivalence relation R on a class S? 

10. What is a partially ordered class? 

11. Give an example of a partial ordering on A(S). 

12. What is a partially ordered set? 

13. What is a strictly ordered class? 

14. Give an example of a strict ordering on A(S). 

15. What is a poset? 


16. If R is a partial order on the set S, what does it mean to say that a is a maximal 
element of S? What does it mean to say that a is a minimal element of S? 


17. What is a chain in a partially ordered set? 
18. What is the maximum element of a partially ordered set? 


19. What is a linearly ordered set? 


EXERCISES 


. Suppose R is a reflexive relation on a class S. Show that Ids C R. 
. Show that if R is reflexive, then R~! is reflexive. 
. Show that if R is symmetric, then R7! is symmetric. 


Aa V Ne 


. Show that if R is transitive, then Rt is transitive. 


. Show that if R is asymmetric, then RN R7! = Ø. 
. Show that if R is an equivalence relation, then R° R = R. 


QO ol 


7. Suppose T is a reflexive relation on a class S. Show that for every relation R on S, 
RCTeRand RC RT. 


8. Show that if R is a partial order relation on S, then RN Rt = Ids and R°R = R. 
9. Show that if R is a partial order relation on S, then so is R71. 


10. Suppose R is an equivalence relation on a class S. Show that if H and J are relations 
on S, then RC (HOJ) >= RC H°J. 
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11. Show that if a is a maximum element of a partially ordered set S, then it is the only 
maximal element of S. 

12. Show that if a is a maximal element of a linearly ordered set S, then it is a maximum 
element of S. 
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7 / Partitions induced by equivalence relations. 


7.1 


Summary. In this section we show that an equivalence relation R on a set 
S can be used to subdivide S into pairwise disjoint subsets. The subsets when 
viewed together are called a partition of S. We illustrate how such partitions are 
obtained. 


Subdividing a set S using an equivalence relation R on S. 


Suppose R is an equivalence relation on the set S. We will show that this equivalence 
relation R, no matter how it is defined, can be used to subdivide the set S' into a 
collection of non-empty pairwise disjoint! subsets. Furthermore, no element of S will 
be left out in this process (in the sense that every element of S will belong to one of 
these subsets). 


The process is similar to subdividing a school’s student population, S, into smaller 
subgroups {.$1, S2, S3,...} based on some predefined student characteristics; the char- 
acteristics are such that no student can belong to two subgroups. 

Note: Although most of the results proven from here on can apply to classes we will 
prove these as applied to sets. 


Notation 7.1 Let R be an equivalence relation on a set S and let x € S. We define the 
set S, as follows: 


Sx = {y : (x,y) € R} 


That is, Sy is the set? of all elements y in S such that y is related to x under R. 


The following five theorem statements show, step by step, how an equivalence relation, 
R, on S partitions the set S into subsets, {Sy : x € S}. The reader should notice how 
all three relation properties which characterize equivalence relations are required to 
partition the set S$ in this way. 


Theorem 7.2 Let R be an equivalence relation on a set S. Let x and y be two elements 
in S which are not related under R. Then any element z in S$ which is related to x cannot 
be related to y. 


1A set .Y of sets whose elements are “pairwise disjoint” means that for any two sets A and B in J, 
ANB=2. 

?We justify that Ss a “set” as follows: We have that S+ is a subclass of the class S; given that S is 
declared to be a set, by axiom A4 (axiom of subset) Sz is a set. 
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Proof: 

What we are given: 

- R is an equivalence relation 

£, Y, ZES 

- (x,y) Z R but (z,z)eR 
What we are required to show: That (y, z) € R. 
Suppose (y,z) € R. Then (z,y) € R (since R is an equivalence relation and so is 
symmetric). But (x,z) € R and (z,y) € R= (x,y) € R (since R is an equivalence 
relation and so is transitive). This contradicts our hypothesis: (x, y) ¢ R Thus, (y, z) ¢ 
R, as required. 


Theorem 7.3 Let R be an equivalence relation on a set S. Let x and y be two elements 
in S which are not related under R. Then Sz N Sy = Ø. 


Proof: 
What we are given: 
- Ris an equivalence relation 
ayes 
(x, y)ER 
What we are required to show: Sz N Sy = Ø. 


Suppose z € Sz N Sy. Then z belongs to both S, and Sy and so is related both to x and 
to y. Since R is an equivalence relation on S, it is transitive and so x must be related to 
y, a contradiction. So z cannot belong to Sz N Sy. So Sz Sy = Ø as required. 


Theorem 7.4 Let R be an equivalence relation on a set S. Let x and y be two elements 
in S which are related under R. Then S; = Sy. 


Proof: 
What we are given: 
- Ris an equivalence relation 
ayes 
- (x,y) ER 
What we are required to show: S; = Sy. 
We claim that Sy C Sy. Let z E€ Sp. Then (x, z) = (z,x) € R (since R is symmetric). 
Since x is related to y, then [(z,x) € R and (z,y) € R]= [(z,y) € R] (since R is 
transitive). So z € Sy. Thus, Sy C Sy as claimed. 
We prove in a similar way that Sy C Sz. Thus, S; = Sy as required. 
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Theorem 7.5 Let R be an equivalence relation on a set S. For every x € S, there exists 
some y € S such that x € Sy. 


Proof: 
What we are given: That R be an equivalence relation and x € S. 
What we are required to show: That there exists some y € S such that x € Sy. 


Since R is reflexive, (x,2) € R. Thus, x € Sp. So it suffices to choose y = x. Hence, 
£ E Sz. 


Theorem 7.6 Let R be an equivalence relation on a set S. Then Upes Sz = S. 


res 
Proof: 


What we are given: That R is an equivalence relation on S. 
What we are required to show: That Upes Sz = S. 


Let x € S. Then x € Sz. Hence, x E€ Upes Sx. Thus, S C Upes- Let y E€ Ures Sz. Then 
there exists some x € S such that y € S,. Since Sy C S for all x € S, then y € S. Thus, 
Uzes Sz E S. We conclude that U eg Sz = S as required. 


7.2 A partition of a set induced by an equivalence relation. 


Given an equivalence relation R on a set S, we have seen how S can be expressed as 
a union, |J „eg Sx, of pairwise disjoint subsets, Sy, of S. Let 


Fr ={Sz: x E S} 


denote the set of all the sets formed from the equivalence relation R. We verify that 
the class g is indeed a set: Since S is declared to be a set, then Z (S) is a set (by 
the Axiom of power set); since Zp is a subclass of the set Z (S), then, by the Axiom 
of subset, Zp is a set. 


We examine the elements of Zr more carefully to see how they relate to each other. 
— Having shown that [(a,y) € R] > [Sz = Sy], many of the sets in Zp are the 
same set. That is, identical sets may simply have different labels. 


— If (x,y) € R, then Sz N Sy = Ø. So any two sets Sy and Sy in Fp are either the 
same set or S; N Sy = Ø. No two subsets in Spr can overlap. 


— Since R is reflexive, then no subset Sy is empty. (For any x € S, x E€ Sx.) 


— On the other hand, if x € S, since R is reflexive, x € Sy and so x € Upes Sz- 
Thus, S C Ures Sz- 


Part III: Relations 65 


— We have already seen that (pes Sz = S. 


We summarize three important properties of Zp: 
2) Sz E Sy > Sr N Sy = Ø. 
3) Sz #4 Ø forall zes. 


The three properties together describe what is called a partition of a set 9.1! A proper 
understanding of the method used to partition a set S in this way is important in a 
our study of set theory. 

Finest and coarsest partitions. Let S be a non-empty set. The first example below 
illustrates the “finest” partitioning of S. The second example illustrates the “coars- 
est” partitioning of S. 


Examples. 


1) The identity relation, Idg, on S is defined as follows: (x,y) € Ids if and only if 
Su: 


a) 
b) 


c) 


d) 


Show that the relation Idg is indeed an equivalence relation on S. 

If Fas = {Sz : £ E S}, describe Sy for each x € S. 

Compare the class of subsets in Aag to the class of subsets W = {{x}: x € 
S}. How do the elements of @ compare to the elements of Aas? 

In your opinion, is it possible to obtain an even “finer” partition of S? That 
is, is it possible to obtain a partition of S where each Sy is strictly smaller 
than the elements of Aas? 


2) Consider the relation R on the set S defined as follows: (x,y) € R if x and y 
both belong to S. 


a) 
b) 


c) 


d) 


Confirm that the relation R is an equivalence relation on S. 

If Ar = {9x : £x € S}, describe Sy for each x € S. 

Describe the elements of r. Which subsets of S are elements of Sr? List 
all subsets of Zr. Is this true or false: SpR = S. 

In your opinion, is it possible to obtain an even “coarser” partition of S? 
That is, is it possible to obtain a partition of S where each Sy is strictly 
larger than the elements of Sp? 


Concepts review: 


Tf a set Ye = {Sx : £ E€ S} of subsets of S is such that Uses Sx = S we often say “Zr covers S” to 


express this fact. 
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. Given an equivalence relation R on a set S and an element x € S, what does the 


symbol Sy represent? 


. Given an equivalence relation R on a set S what does the symbol Zp represent? 


. Suppose -Z denotes a set of subsets of S. What does it mean to say that partitions 


S? 


. Given an equivalence relation R on a set S what are the essential properties of ZR? 
. What is the “finest” partition of S obtainable by an equivalence relation on S? 


. What is the “coarsest” partition of S obtainable by an equivalence relation on S$? 


EXERCISES 


. Let S be a set and let R and T be two equivalence relations on S. Is the relation 


V = RUT necessarily an equivalence relation on S? Explain. 


. Suppose R is an equivalence relation on a set S and that A C S where A is non- 


empty. We define the relation R4 on A as follows: 
Ra={(x,y): (2,y) € RN (A x A)} 


Show that R4 is an equivalence relation on A. 


. Suppose R is a partial order relation on a set S and that A C S where A is non-empty. 


We define the relation RA on A as follows: 
Ra={(x,y): (ay) € RN (A x A)} 


Show that Ry is a partial order relation on A. 


. Let S be a set and let R and T be two equivalence relations on S. Show that 


V = RAT is an equivalence relation on S. 


. Suppose R and T are two equivalence relations on a set S. For each x € S, let 


RSz = {y : y ES, (x,y) E€ R} and 7S, = {y : y E€ S, (x,y) E€ T}. If, for each x € S, 
RS2 C TSx show that RCT. 


. Suppose R and T are two equivalence relations on a set S. For each x € S, let 


RS; = {y : y E S, (x,y) € R} and 7S, = {y : y E S, (x,y) € T}. If RCT, show 
that for each x € S, pS; C TSz. 
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7. Let S be a set and let R and T be two equivalence relations on S. For each x € S 
let RS, = {y : y E€ S, (x,y) E€ R} and 7S; = {y : y E€ S, (x,y) € T}. Let 
Fr = {RSx : £ E S} and Sr = {rS : x E S}. We have seen that Spr and Yp form 
sets of non-empty subsets of S which are pairwise disjoint and cover all of S. For 
each x € S, let Sy = RS; N 7Sz. Show that Y = {Sz : x € S} forms a set of subsets 
of S satisfying the properties: 


a) Sz # Ø for each z. 
b) Whenever 5S; Æ Sy then Sg N Sy = Ø. 
c) Uzes Sx = S. 
8. Let S and T be sets. It has been shown that“C” constitutes a partial order relation 


on the set Z (S). Consider the set L = P(S) x A(T). We define a relation R on L 
as follows: For (A, B) and (C, D) € A(S) x A(T) 
ACC 
(A,B), (C, D)) E€ Re or 
A=C and BCD 


Show that R is a partial ordering relation on L.! 


This relation is often called the lexicographic ordering of a Cartesian product. 
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8 / Equivalence classes and quotient sets. 


Summary. In this section we continue our discussion of partitions of a set S 
by an equivalence relation. When a set S is partitioned by an equivalence re- 
lation R the subsets in this partition are called “equivalence classes induced by 
R”. These subsets together are called a “quotient set of S induced by R”. We 
then show that any partition of S is induced by some equivalence relation R. 


8.1 More on partitions. 


We have seen that an equivalence relation, R, on a set, S, subdivides S into pairwise 
disjoint subsets that cover all of S. Tools that allow us to systematically partition 
sets into subsets are important in mathematics. In set theory, this can be a method 
to construct a new set from a known set S by partitioning S into smaller pieces and 
forming a new set whose elements are those pieces. We have casually spoken of par- 
titions with an intuitive understanding of what they are. We should formally define 
them before we go on. 


Definition 8.1 Let S be a set. We say that a set of subsets @ C A(S) forms a partition 
of S if @ satisfies the 3 properties: 


1) Uascg 45 
2) If A and BE@ and AFB, then AN B = Ø. 


3) AA Ø forall AEG. 


Based on this definition, we can see that the set of subsets of S, Sr = {S,: x£ E S}, 
formed by the equivalence relation, R, is a partition of the set S.‘ The set, Zr, and 
its elements are referred to in a particular way in various fields of mathematics. We 
formally define these below. 


Definition 8.2 Let S be a set on which an equivalence relation, R, is defined. 


a) Each element S, of Sr = {Sx : x E S} is called an equivalence class of x under R or 
an equivalence class induced by the relation R.? 


‘We can say Zr partitions S. 
?The expression “equivalence class of x modulo R” is sometimes used. 
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b) The set, Yr = {Sz : x E€ S}, of all equivalence classes induced by the relation, R, is 
called the quotient set of S induced by R. The set, Zr, is more commonly represented 
by the symbol, S/R. So 


8.2 


S/R=15,2ee 5} 


From here on we will use the more common notation, S/R. 


Note that if S is a set, then by the Axiom of subset, the equivalence classes in 
S/R = Srp = {Sz : x € S} are in fact “sets”. But if S is a proper class, then it 
may occur that the elements of S/R = {Sz : x E€ S} are equivalence classes which are 
also proper classes. 


Examples of quotient sets induced by an equivalence relation R. 


In the following examples, S and T are non-empty sets. 


a) 


Recall that Idg denotes the identity relation on S: 
(x,y) € Ids if and only if £ = y 
We have seen on page 53 that Ids is an equivalence relation on S. For each x € S, 
the equivalent class of x induced by Id, is 
Ss = {y: (x,y) € Ids} 
= {y:c=y} 
= {x} 
So the quotient set of S induced by Idg is 
S/Idg = Aa, = {{x}: x£ E S} 
The set a, is the largest possible quotient set on S induced by a relation. We 
also say that this quotient set is the “finest” partition of S. 


Let R be the relation defined as follows: (x,y) € R if and only if x and y belong 
to S. Then R= S x S was shown to be an equivalence relation on S. For x € S, 


Sr = fy: ye S,(z,y) € R} 
= {y: ye S$} 
S 
Hence, for every x € S, the equivalence class of x induced by R is Sz = S. So the 
quotient set of S induced by R only contains the element Sy = S and no other. 
Since 
S/R = Sp = {S} 


it is the smallest possible quotient set of S induced by a relation. We also say that 
this quotient set is the “coarsest” partition of S. 
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c) For non-empty sets S and T, let D = S x T. We define a relation, R, on D as 
follows: 


((a, b), (c, d)) € R if and only if a = c 


We have seen on page 53 that R is an equivalence relation on D. Let (a,b) € D. 
Then 


S(a,b) = {(x,y) : (x,y) € D, x= a} 
{(a,y):y ET} 
= {a} xT 


Thus, {a} x T is the equivalence class of (a,b) induced by R. The quotient set of 
D induced by R is 


D/R= 2g ={{xr}xT:xE S} 


8.3 Equivalence relations defined from a partition. 


We have seen how any equivalence relation R on a set S can be used to partition 
S into pairwise disjoint subsets. We will now work the other way around: If we are 
given a partition, @, of S, is there an equivalence relation, R, such that Yr = €? 
We will see if we can construct the required equivalence relation. 


Let S be a set on which we have defined a partition @. 


This means that @ is a class of non-empty pairwise disjoint subsets of S which covers 
all of S. Suppose we define a relation Ry on S in the following way: 


(x,y) € Re if and only if {x,y} C C for some element C in € 


Thus, the only pairs of elements x and y of S which are related under Ry are those 
pairs that appear together in the same subset C € @. Is R¢ an equivalence relation? 


— We verify that R¢ is reflexive: For every x € S, x belongs to some C and so 
{x} = {x,x} C C and so (xz, x) € Re. 

— We verify symmetry of Re: If {x,y} C C € @, then {y,x} C C. So (x,y) € 
Rg => (y, x) E€ Re. 

— We verify transitivity of Ry: If {x,y} C C and {y, z} C C, then {z, z} CC. So 
(x,y) € Reg and (y, z) E€ Re => (a, z) € Re. 


The relation Ry is indeed an equivalence relation on S. We conclude that any parti- 
tion @ of a set S defines an equivalence relation Rg on S. This result deserves to be 
called a theorem. 
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Theorem 8.3 Let S be a set and @ be a partition of S. Let Ry be the relation such that 
(x,y) € Re if and only if {x,y} C C for some element, C, in @. Then Ry is an equiva- 
lence relation on S. 


8.4 Refining an equivalence relation!. 


Let S = {a,b,c,d,e}. Suppose R,T,K,Idg and M are equivalence relations on S$ 
which are defined as follows: 


R = {(a,a), (b,b), (c, c), (d, d), (e,e), (a, b), (b, a), (d, c), (c,d) } 

T = {(a,a), (b,b), (c, c), (d,d), (e, e), (a, b), (b, a), (c, d), (d, c), (c, e), (e, c), (d, e), (e, d)} 
K = {(a,a), (b,b), (c, c), (d, d), (e, e), (a, c), (c, a), (a, b), (b, a), (c, b), (b, c), (d, e), (e, d) 
Ids = {(x,y):x € S,y €S and z = y} = { (a, a), (b, b), (c, c), (d,d), (e,e)} 

M = {(x,y):xrES,yES}J}=SxS 


Verify that R, T, K and M are indeed equivalence relations. (Note that it suffices to 
show that R, T and K partition S.) 

We describe explicitly the elements of SR, ST, SK, Fiag and Sm. (Recall that these 
can also be expressed in the form S/R, S/T, S/K, S/Ids, $/M):? 


Sr = {RSa, Rb, RISc, RSd, RSe} 
= {{a,b, }, {c,d}, {e}} 
Sp = {r Sa, TS, TSc, T Sd, T Se} 
= {{a,b, }, {e,d,e}} 
SK = {r Sa, T Sb, TSc, T Sa, T Se} 
= {{a,b,c}, {d,e}} 
as = {idsdSa, Idg Sb, Idg Sc, Idg Sd, Ids Se} 
= {{a}, {b}, {c}, {d}, {e}} 
fm = {T Sa, TSb, TSc, TSa; T Se} 


= {{a,b,c,d,e}} = {S} 
We now make a few observations: 
a) We see that each pair in R belongs to the relation T. So we can write R C T. 
Similarly, we see that the Ids is a subset of each of R, T and K. But this 


relationship doesn’t hold true between R and K: The pair (c, d) is an element of 
R but not of K. So RỌ K. 


b) Notice how every equivalence class under R is contained in an equivalence class 


under T: 
{a,b,} {a, b, } 
{c,d} {c, d,e} 
{c} {c, d, e} 


1This section can be omitted without loss of continuity. 
Recall that RSs = {y : (x,y) € R} where R is an equivalence relation on S. 
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Similarly, every equivalence class under Idg is a subset of some equivalence class 
under R. The same can be said for the relationship between the equivalence 
classes of Idg and of K. But this relationship between the equivalence classes 
under R and the equivalence classes under K doesn’t hold true. Witness: 


{c,d} Z {a,b,c} 
{c,d} Z {d,e} 


c) In cases such as R and T above, we say that the equivalence relation R refines 
or is a refinement of the relation T. We can make the more general statement: 


An equivalence relation R refines the equivalence relation T whenever 
RCT. 


This is the same as saying “R refines T if every equivalence class under R is 
a subset of some equivalence class under T”. We see that Ids will refine any 
equivalence relation R. We see that R does not refine K since R Z K. Similarly, 
we see that T does not refine K since T Z K. 


d) Also note that no matter which equivalent relation R on S we consider, R C M, 
and so M is refined by any equivalence relation on S. 


Concepts review: 


. Given a set S, what does it mean to say that the class @ of subsets of S partitions S? 


. Given an equivalence relation R on a set S and an element x € S, what is an equiva- 


lence class of x under R? How is it denoted? 


. Given an equivalence relation R on a set S, what is the quotient set of S induced by 


R? How is it denoted? 


. Given an equivalence relation R on a set S what do the expressions pg and S/R 


mean? 


. Given a partition @ of a set S can we define an equivalence relation R on S such that 


S/R=@? 


. What does it mean to say that an equivalence relation R refines the equivalence 


relation T? 


. Is there an equivalence relation on a set S that refines all other equivalence relations? 


. Is there an equivalence relation on a set S that is refined by all other equivalence 


relations? 
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EXERCISES 


1. Suppose R is an equivalence relation on a set S and that A C S where A is non- 
empty. We define the relation R4 on A as follows: 


Ra = {(x,y): (x,y) € RO (A x A)} 


a) Show that R4 is an equivalence relation on A. 


b) If S/R = Yr = {Sz : x € S} represents the quotient set of S induced by R, 
describe the elements of the quotient set, Sra, of A induced by RA. 


2. Let S be a set and let R and T be two equivalence relations on S. 

a) Show that V = RAT is an equivalence relation on S. 

b) If S/R = Fr = {RSx : x E€ S} and S/T = Sr = {rx : £ € S} represent 
the quotient set of S induced by R and induced by T respectively, describe the 
elements of the quotient set of S, Zy, induced by V. 

3. Suppose R and T are two equivalence relations on a set S. For each x € S, let 
S/R = Fr = {RSxz : x E€ S} and S/T = Fr = {rS, : £ € S} represent the quotient 
set of S induced by R and induced by T respectively. If RC T, show that for each 
res, 

TS. = U RSy 


YeTSz 


4. Let R and T be two equivalence relations on a set S. If R C T we say that R is finer 
than T (or is a refinement of T). The choice of these expressions when comparing 
two equivalence relations is suggested by the result described in problem 3: The 
quotient sets of a finer equivalence relation all seem to fit neatly inside the quotient 
sets of a coarser equivalence relation. Give the finest possible equivalence relation 
on S. Give the coarsest possible equivalence relation on S. 

5. Let S = {a,b,c,d,e, f} be a set. Suppose R and T are equivalence relations on S 
defined as follows: 


R Ids U {(a, b), (b,a), (b, c), (c, b), (a,c), (c, a), (d, c), (c, d)} 
T = IdsU{(b,c),(c,b)} 


Write out explicitly the elements of the sets Zr and Fr. 


6. Let S be a set. Let R and T be two equivalence relations on S where R C T 
(that is R is finer than T). For each x € S, let S/R = Fr = {RSx : £ € S} and 
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S/T = Sp = {rS,: x E€ S} represent the quotient set of S induced by R and induced 
by T respectively. We define the quotient of T by R, denoted T/R, as follows: 


T/R = {(RSz; RSy) : (x,y) € TH 


From this definition we see that T/R is a relation on S/R = Zp. That is pS, and 
RSy are related under T/S if and only if x and y are related under T. 


a) Show that T/R is an equivalence relation on S/R = Sp. 

b) Let K be another equivalence relation on S where R C T C K. Show that 
T/GCK/R. 

c) Referring to the example on page 71 write out explicitly the elements of the 
equivalence relation T/R on Jp. 

d) Referring to the set S described in question 5 above, write out explicitly the 
elements of the equivalence relation T/R on Sp. 
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9 / Functions: A set-theoretic definition. 


9.1 


Summary. In this section we formally define what we mean by a “function”. 
This is done using only the set-theoretic concepts developed up to now. We 
introduce notation to simplify the discussion and definition of these concepts. 
Examples of simple functions such as the identity function, the characteristic 
function and constant functions are presented. Given a function, f, on a set A, 
we define the restriction of this function on a subset, D, of A. We state what we 
mean by “equal functions”. The expressions “one-to-one”, “injective”, “onto”, 
“surjective”, “bijective functions” are also defined. 


A set-theoretic definition. 


The concept of a function is not new to most readers. A standard definition goes 
something like this: “Given two sets A and B, a function is a rule, f, which associates 
to each element of A a single element of B”. 


To construct a function, first a property involving two elements x, y is defined. Say 
we represent this property by (x,y). The property ¢ is the blueprint which is used 
to construct a subset 


f={(x,y)€ Ax B: x, y satisfies the property (x, y)} 


of the Cartesian product A x B. So ¢ is the tool used to distinguish those ordered 
pairs (x, y) which belong to f from those that don’t. This defines a relation, f. If it 
can be shown that “(z,y) and (x, z) belong to f implies y = z”, then f is called a 
function. Since a function f is a subclass of a Cartesian product of sets, then functions 
are, by definition, sets. In practice, users often do not distinguish between the rule ¢ 
and the set f whose elements are determined by it (even though ¢ is just a formula, 
while f is a well-defined set and so is governed by axioms associated with sets). For 
example, let A = {1,2} and B = {2, 3, 4}; define the rule ¢ as “the second element is 
twice the first”. The set which follows from ¢ is, f = {(1, 2), (2,4)} c A x B. Or, we 
could write, f = {(z,y): 2 € A, G(x, y) is satisfied} C A x B. For practical reasons, 
we normally just use the symbol f to represent both the rule and the set which flows 
from it. Opportunities to say more about the notion of a function abound in the 
following chapters in this text. 


Our objective will be to define the concept of a function within the ZFC-universe, 
without adding any new primitive concepts to the three we already have: “class”, 
“set”, “belongs to”. We must formulate this definition carefully so that it represents 
precisely what we want and understand it to be. 


For most readers the notion of a “function” is intrinsically linked to those sets we call 
“numbers”: natural numbers, integers, rational numbers and real numbers. So it may 
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appear strange to discuss the concept of a function before describing what numbers 
actually are in set theory. We have not yet shown how numbers can be constructed 
using our ZFC axioms. This is to come. In what follows, we will see that func- 
tions exist independently of numbers. Functions can be defined in terms of abstract 
sets. Studying functions in the absence of numbers will allow readers to better see, in 
essence, what they truly are. 


Definition 9.1 A function f mapping elements from a set A into a set B is a triple (f, A, B) 
satisfying the following properties!: 


1) fCAxB 


2) For every a € A there exists b € B such that (a,b) € f.? 


3) If (a,b) € f and (a,c) € f, then b = c. Equivalently, if (a,b) € f and (c,d) € f, then 


9.2 


(b#d) = (a#o).) 


From this definition we see that a function f C A x B is a special type of relation on 
AUB with dom f C A andim f C B. A function f can also be viewed as a particular 
element of P(A x B). 


Commonly used notation when discussing functions. 


There is no reason why we should adopt functional notation which is different from 
the one we are accustomed to. We should however explain carefully how this notation 
is to be interpreted in set theory. 


— Rather than represent a function as (f, A, B) we will write 
f:A-B 


and say “f maps elements of A into B”. When we write, “f : A — B”, it will 
always be understood that A and B are sets. 


— If (x,y) € f, we will write 
f(t) =y 
and say that y is the image of x under f. 
— We will also say that x is a preimage or an inverse image of the element y. 


'By “a triple (f, A,B)” we mean that a function is characterized by three sets f , A and B with the 
described properties. 

Using logical symbols: Vz € A Jb € B| (a,b) € f 

3Using logical symbols: [(a,b) € f] A [(a,c) € f| > (b=). 





Part IV: Functions 79 


9.3 


The definition of a function, f, states that f is a subset of A x B and therefore is a 
relation. If A = B it is a relation on A with the extra condition: “f(a) 4 f(b) => a # 
b”. Since a function is a relation we can then speak of its domain and its image. 


— From the definition of “function”, we see that for every x € A, there is some y € B 
such that (x,y) € Ax B. So, by definition of the domain of a relation (see definition 
5.3), 

A= dom f 


— It may be that not every y € B is such that (x,y) € f for some x in A. So we must 
be clear about what we mean by the image of f: 


im f = {y E€ B : (x,y) E€ f for some x € dom f} 


If A is the domain of the function f C Ax B, then we will express the image, im f, 
of A under f as 
im f = f[A] 


— Note that im f is contained in B and need not be equal to B. To distinguish 
between f[{A] and B we will refer to B as being the codomain of A, abbreviated as 
codom f. The words “range of f”, denoted as ran f, is often used instead of “image 
of f”. In such cases you will read 


ran f = {Al 


Restricting a function to a subset of its domain. 


Suppose we are given a function, f : A — B. Then f is mapping each element in its 
domain A into B. If D C A, then we may restrict the domain of f so that it only acts 
on the elements of D. We will show that f : D — C is also a function: 


Since f : A— B isa function then by definition, for every x € A, there exists y € B 
such that f(x) = y. Since D C A then for every x € D, x € A; hence there exists 
y € B such that f(x) = y. Since the image (under f) of every x € A is unique, then 
the same is true for every z € D C A. Thus, by definition, f : D — B is a function. 


Notation to express the restriction of a function f to a subset of its domain will be 
useful. We introduce this now. 


Definition 9.2 If f : A — B is a function and D C A, then we say that the function 
f :D— Cis a restriction of f to D. In this case we will use the symbol, f|p, to represent 
the restriction of f to D. Note that if D C A, then we can write, f|p C f, since 


flo ={(2,y):@ € D and (x,y) E f} Cf 
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We will now see that a function, f : A — B, can always be expressed as the union of 
two functions, provided its domain contains more than one element. 


Theorem 9.3 Let f : A— B be a function and suppose A = C'U D, where neither C nor 
D is empty. Then f = flc U flp. 


Proof: 
Given: A function f : A — B is defined and A = C U D. 
f = {(2,y): (x,y) € Ax Band (x,y) € f} 
= {(x,y): (x,y) € (Ax B)N f} 
= {(z,y):(@,y) € (CUD) x BN f} 
= {(2,y): (x,y) €[(C x B)U(D x B) | Of} (theorem 4.7) 
= {(x,y):(x,y)E[ (Cx B)Nf] U [ (Dx B)A f J} (theorem 3.9) 
= {(2,y): (zy) € (Cx B)N f} 
U {(x,y) : (x,y) € (D x B)N f} 
= {(x,y): (x,y) € (C x B) and (x,y) € Nf} 


U {(x,y) : (x,y) € (D x B) and (z, y) E€ Nf} 
fle U fip 


9.4 Equal functions. 


We know that two sets are equal provided both sets contain the same elements. Since 
functions are defined as being sets of ordered pairs, then we can establish equality of 
two functions by comparing the elements of the sets they represent. If the function f 
and g contain the same ordered pairs, then we can write f = g. 


Theorem 9.4 Two functions f : A — B and g : A —> B are equal if and only if f(x) = g(x) 
for all x € A. 


Proof: 
f=9 & Forany z€ A, (x,y) € f if and only if (x,y) € g 
< ForanyzeE€ A, f(x) =yand g(x) =y 
<= For any z€ A, f(x)=g(x) 
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9.5 Some particular types of functions. 


We present a few elementary functions with particular properties often encountered 
in various fields of mathematics. 


Definition 9.5 Let f : A — B be a function. 


a) We say that “f maps A onto B” if im f = B. We often use the expression “f : A —> B 
is surjective” instead of the word onto. 


b 


wn 


We say that “f maps A one-to-one into B” if, whenever f(x) = f(y), then z = y. We 
often use the expression “f : A — B is injective” instead of the words one-to-one into 
B. 


c) If the function f : A — B is both one-to-one and onto B we then say that f is “one- 
to-one and onto”. Another way of conveying this is to say that f is bijective, or f isa 
bijection. So “injective + surjective = bijective”. 

d) Two classes (or sets) A and B for which there exists some bijective function f : A— B 
are said to be in one-to-one correspondence. 


9.6 A few examples of simple functions. 


a) The constant function. Let A and B be two sets and suppose b € B. Define the 
function f : A — B as follows: 


f(z) =b for all z € A 


This function maps all elements of A to the same element of B. We call this a 

constant function. 

— If BF {b}, then f is not “onto B” or surjective; it is just “into” B. 

— However, if we were to write, f : A — {b} of course we could say that f is 
surjective. 

— If A has only one element, say A = {a}, then f : {a} — {b} is a constant 
function which is bijective. 


b) The characteristic function. Let C be the 2 element set C = {@,{@}}.1 Let A 
be a set and D be a non-empty subset of A. We define a function denoted by 
Xp : A > C as follows?: 


‘We should justify that C is a set: Since Ø is a subclass of any set, then by the Axiom A4 (Axiom of 
subset), Ø is a set. Also {Ø} is a set since {Ø, Ø} is a set (by the axiom of pair). Again by the axiom of 
pair, {@,{@}} is a set. 

?The Greek letter x is pronounced “kie” (like the word “pie”). 
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JØ if x€D 
oe ee if «eD 


This is called the characteristic function of D in A. 
— We see that x, is “onto C”. 
— The characteristic function is constant on D mapping all elements of D to the 
single element {Ø}. It is constant on A — D mapping all elements of A — D to 
the single element Ø. 


— We can write 


Xp = {(x,Ø):x E€ A—-D}U {(a,{@}): 2 E€ D}} 
(Xp) |A-p U (xp) |p 


c) Recall that in theorem 4.9, we showed that the elements of the two classes A x 
(B x C) could be matched one-to-one with the elements of (A x B) x C. The proof 
of this theorem shows that the function f : A x (B x C) — (A x B) x C defined 
as f((a, (b,c))) = (a, (b, c)) is a bijection between these two classes. 


Class functions. 


A “function” was formally defined as being a particular kind of subset of the Cartesian 
product of two sets. Suppose that X and Y are classes (possibly proper) and f is a 
subclass of X x Y which satisfies the conditions given in definition 9.1. In order to 
distinguish f from the notion of “function” as presented in definition 9.1, we will refer 
to f as a class function, keeping in mind that f may be a proper class. 


Concepts review: 


. What is the definition of a function f from a set A to a set B? 
. Is it acceptable to view a function f from a set A to a set B as a set of ordered pairs? 


. Given a function f from a set A to a set B, what do each of the sets dom f, codom f, 


im f and ran f represent? 


. Given a function f from a set A to a set B and y € im f, what is the preimage or 


inverse of y? 


. Given a function f from a set A to a set B and a set D such that D C A what does 


the symbol f|p mean? 


. Given a function f from a set A to a set B where A = C U D is it true that f = 


fic U flo? 
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7. Given the two functions f : A > B and g : A— B what does it mean to say that the 
functions f and g are equal? How can we show that f = g? 
8. Given a function f from a set A to a set B what does it mean to say that f is onto 
B? 
9. Given a function f from a set A to a set B what does it mean to say that f is 
one-to-one into B? 
10. Given a function f from a set A to a set B what does it mean to say that f is injective? 
11. Given a function f from a set A to a set B what does it mean to say that f is 
surjective? 
12. Given a function f from a set A to a set B what does it mean to say that f is 
one-to-one and onto B? 
13. Given a function f from a set A to a set B what does it mean to say that f is bijective 
(or f is a bijection)? 
14. Given a function f from a set A to a set B is it safe to say that if f is both injective 
and surjective, then it is bijective? 
15. Given two sets A and B what does it mean to say that there is a one-to-one corre- 
spondence between A and B? 
16. If D is a subset of A what is the characteristic function y|p of D in A? Describe y|p 
as a set of ordered pairs. 
EXERCISES 


1. Suppose A is a set. Show that the set {(x, x): x € A} is a function. 
. Let f : A — B be a function. Show that if g C f and g is non-empty, then g isa 


function. 


. Suppose f : A— B and g : A — B are two functions each of which has the set A as 


domain. If f C g show that f = g. 


. If C is as set, let f : C — Ic be defined as f(a) = (a,a) for all a € C 


a) Show that f satisfies the definition of a function. 
b) Show that f is one-to-one and onto and therefore is bijective 
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. Let D and E be two sets such that DN E = Ø. Let g: D— B and h : E— B be 


two functions. Let f = gUh. 
a) Show that f : DU E — B is a function. 
b) Show that f|p = g and f|z =h. 


. Let f: A— Band g : C — D be two functions. We define (f xg): Ax C —> Bx D 


as follows: 
(F x g)((x,y)) = (f(a), g(x) for all (x,y) € A x C 
a) Show that (f x g): Ax C —> Bx D is a function. 


b) Show that if f : A — B and g : C — Dare bijective, then (fxg): AxC > BxD 
is bijective. 


. Let f : A — B and g : C — D be two bijective functions where AN C = Ø and 


BAD= Ø. Let h: AUC — BUD be defined as follows: 


_ f f(x) forall zeA 
he) { g(x) forall rEC 


a) Show that h : AUC > BU Disa function. 
b) Show that if f : A— B and g : C — D are bijective, then h: AUC — BUD is 
bijective. 


. Let S and T be sets and f be a function on S x T defined as: f((x,y)) =x for all 


(x,y)E Sx T. 

a) Verify that f is indeed a function. 

b) Describe the image of f. 

c) Verify whether f is one-to-one or not. If it is prove it, if it isn’t show why not. 


. Let A be a set and D C A. Recall that x|p is the characteristic function mapping x 


to {Ø} if x € D and z to Ø if x g D. Show that im x|p is a set. 


Let f: A— Band g : C — D be two bijective functions. Leth: AUC => BUD 
be defined as follows: 


_f f(x) forall zeA 
a= { g(x) forall zeC 


a) Is h necessarily a function? If it isn’t, give an example illustrating this. 

b) If h is a function is h necessarily bijective? If it isn’t give an example illustrating 
this. 

Let f : A — B be a function. Show that if g C f, then there exists some subset C 

of A such that fle =g. 

Is @ a one-to-one function? Explain. 
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10 / Operations on functions. 


Summary. In this section we define the composition, gof, of two functions 
f:A— Bandg: B— C. We view “composition of functions” as an operation 
“o” on two functions f and g. From this perspective we then discuss the main 
properties of composition of functions (such as non-commutativity and associa- 
tivity). It is in this particular context that we describe the identity function and 
the inverse of a function. We also define the concept of “invertible function”. 


10.1 Composition of functions: a set-theoretic definition. 


Suppose f : A — B and g : B — C are two functions. A noticeable fact about 
these two functions is that the domain of the function g is the codomain of f. So for 
x € A we have f(x) € domg. For such an element x € A, the expression g(f()) is 
well-defined. This allows us to construct the set: 


h={(z,y): ce A,y=g(f(x))€ img} CAxC 


By the Axiom of construction A2, h is a well-defined subset of A x C. With these 
thoughts in mind, we formally define this notion of “composition of two functions”. 


Definition 10.1 Suppose f : A — B and g : B — C are two functions such that the 
codomain of the function f is the domain of the function g. Let 


h = {(x,z) € Ax C : y = f(z) and z = gly) =9(f(z)) } 


Thus, (x, z) € h if and only if (x, z) = (x,g(f(x)). We will call h the composition of g and 
f, and denote it by gof where (gof)(x) = g(f(£)). 


Given the functions f : A — B and g : B — C and seeing that gof C A x C we 
naturally suspect that gof : A — C is a function. We will, of course, have to make 
sure that this is the case. 


Theorem 10.2 Let f: A — B and g : B — C be two functions such that the codomain 
of the function f is the domain of the function g. Then the composition of g and f, 
(gof): A — C, is a function. 


Proof: 


Given: f : A — B and g : B — C are two functions. 
We are required to show that gof is a function. 
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1) By definition of h = gof, 
h=gfCAxC 


2) Let x € A. We are required to show that h(x) € C. 


zEA => f(x) € B (Sinceimf cB) 
= f(x) 

=> g9(f( 

(x) 


> h TIE C (Since g(f(x)) = h(«)) 


E€ dom g (Since im f C domg) 


x)) € C (Since g : B — C is a function ) 


Thus, x € A => h(x) = (gef)(x) € C. 
3) Suppose h(a) = g(f(a)) # g(f (b)) = h(b). 


h(a) =g9(f(a)) FA g(f(b)) = h(b) 
= f(a) # fd) 
>a Æ b 


The three conditions being satisfied, we conclude that gof is a function. 


10.2 Composition of functions viewed as an operation on functions. 


Given two functions f : A — B and g : B — C, we have shown that we can associate 
with this pair of functions another function h = gof called “the composition of f and 
g” X 

This suggests that “composition”, denoted by the symbol, °, can be viewed as an op- 
eration on pairs of functions, just like x, U, M and +. We wonder: 


1) Can we compose any pair of functions? 


The definition of composition of functions makes it quite clear that we can’t 
compose certain pairs of functions. For the composition gof of two functions f 
and g to be well-defined the image of f must be in the domain of g. 


2) Is the composition of functions commutative? 


Again, it is clear from the definition of composition of functions that we can’t 
commute certain pairs of functions with respect to composition. 


— Suppose for example that we can compose the functions f : A — B and 
g:C — D in this order: fog. 

— Then, this means that dom f C img. But if domg is not contained in the 
im f, the expression, gof, is not meaningful; so we cannot commute the pair 


f and g. 
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3) Is the composition of functions associative? 


Yes. The following theorem shows that the composition of functions satisfies the 


associative property. 


Theorem 10.3 Let f: A— B,g:B 
ho(gof) = (heg)ef. 


Proof: It suffices to show that ho(gof) C 


Proof of ho(gof) C (heog)of: 


— C and h : C — D be three functions. Then 


(hog)of and (hog)of C ho(gof). 


(x,y) € ho(gof) => [gf] (z) = y 

= hig(f(2))=y 

= h(z)=y for some z € domh C C 

= g(f(z))=zEC 

= g(u) =z for some u € domg C B 

=> f(x)=ueB 

((hog)of)(@) = (heg)(F(x)) = (hog) (u) 
= h(g(u)) 
= h(z) 
= y 
((heg)of)(t)=y => (z,y) € (heg)of 
We have shown that ho(gof) C (heg)of. 


To show (heg)of C ho(gof) we proceed similarly. This is left as an exercise. 


An identity element for the operatio 


n “co”, Is there a function J such that any function 


g composed with J will leave that function unchanged, i.e., gol = Iog = g? 


An obvious candidate for an identit 


y element with respect to composition is the iden- 


tity relation, 1: Y — Y.* It is defined as, I(x) = x, for all x € X. If A is a set, then 
I, will denote the restriction of J to A and so I4 : A — A is defined as [4(x) = x, for 
all x € A. The following theorem confirms that this function behaves as expected. 


Theorem 10.4 Let f: A— B. Then Ipof = f and fol, = f. 


TRecall that Y denotes the class of all elements and is called the Universal class. 
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Proof: 


Given: f : A— B and Ig(x) = z for all z € B. 


(x,y) € Igof < (ax,z) € fand (z,y) € Ip for some z € B 
< (x,z)€ f and z = y for some z € B 
S (z,y)Ef 
Thus, Ipof = f. 
The proof of fol4 = f is similar. It is left to the reader. 


o ” 


Inverses with respect to and the identity I. Once an identity element J has been 
identified, one naturally wonders whether certain functions f have an inverse g with 
respect to this identity so that gof = I? 


We will show that only certain functions have an “inverse” with respect to “composi- 
tion”. 


Definition 10.5 Let f: A — B. If g: B — Aisa function satisfying gof = IA, then we 
will call g an “inverse of f”; we represent g as f~!. 


Theorem 10.6 Let f : A — B be a one-to-one onto function. 


a) An inverse function, f7} : B —> A, of f exists. 

b) The function, f~!, is one-to-one and onto. 

c) The function, f : A — B, is the inverse of f~! : B > A. That is, (f~')~! = f. 
) 


d) The inverse function, f7}, of f is unique. 
Proof: 


What we are given for parts a) to d): That f : A — B is a one-to-one onto function. 


a) What we are required to show: That there exists a function g such that g(f(x)) = z. 
This function g will be f7!. 


Define g : B > A as follows: g(x) = y only if f(y) = x. We claim that g: B — A is 
a well-defined function: 

— Let x € B. Since f is onto B, then there exists y € A such that f(y) = x. Thus, 
dom g = B. Suppose now that (x,y) and (x, z) are in g. Then y and z are in A 
such that f(y) = x and f(z) = x. Since f is one-to-one, then y and z must be 
the same element. Thus, g : B — A is a well-defined function. 
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Then g satisfies the definition of an inverse of f, g(f(x)) = x, and so gof = I4. Thus, 
g= f. 


b) What we are required to show: That f~! is one-to-one: 
Suppose (x,y) and (z, y) both belong to f~! : B — A. Then f(y) = x and f(y) = z. 
Since f : A > B is a function, then x = z. Thus, f~! is one-to-one on its domain as 
claimed. 


c) What we are required to show: If f7! : B— A and f : A— B, then f.f~! = Ip: 
We are assuming that A = im f7! and A = dom f. Suppose fof™t(x) = z. Then 
there is some y € A such that (x,y) € fT! and (y,z) € f. But since f~! is the 
inverse of f, (x,y) € f7! implies (y, x) € f. Since both (y, x) and (y, z) belong to f, 
then z = x. Then f.f~!(x) = x for all x € B. We conclude that f is an inverse of f71. 


d) What we are required to show: If h : B — A is a function satisfying hof = I4, then 
h can only be f~t. 





hof =I = ff => (hof) f = (ff 





= ho( fof 1) =f (fof £) (Associativity) 
=> hg =f tIp 
o ee 


This theorem confirms that: 
— If f is one-to-one on its domain, then f has an inverse f~t 
— This function f~! is unique and is one-to-one. 


Conversely, if f : A — B has an inverse f~!, then for a,b € A 


fla) A f(b) => Fa) =a and f-*(f(6)) =6 


> a x b (Otherwise f maps a = b to distinct points f(a) and f(b).) 
so the function f must be one-to-one. 


We have shown that “f has an inverse if and only if f is one-to-one”. 


Definition 10.7 Invertible functions, f : A — B, on A are precisely the one-to-one func- 
tions on A. 
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10.3 The inverse of the composition of functions. 


Suppose we are given the two functions, f : A — B and g : B — C, both of which are 
one-to-one and onto functions. The following theorem shows us how to proceed when 
we wish to find the inverse of their composition, (gof)~!. 


Theorem 10.8 Let f : A — B and g : B — C be two one-to-one and onto functions. 


a) Then the function, (gof) : A — C, is also one-to-one and onto C. 


b) Then the inverse of gof, is (gof)7t = flog! 


Proof: The proofs of these statements are left as an exercise. 


10.4 Comparing the inverse of a function to the inverse of a relation. 


Recall that “inverses” were discussed before we defined the notion of a function (see 
definition 5.4). We referred to inverses while studying relations and some of their 
properties. We pause to compare the inverse of a relation to the inverse of a function 
so that we can better see how they are similar and how they differ. 


— Relations. Given sets A and B, any subset of {(a,b) : (a,b) E€ Ax B} isa 


relation. No other conditions are specified. For any relation R C A x B we can 
construct another relation, R7}, called its inverse. This inverse is defined as: 
R! = {(y, x): (z,y) € R}. 

Functions. A function f : A > B is a set of ordered pairs { (a,b) : a € A, b = 
f(a) € B} and so is a relation. But those relations we call “functions” must 
satisfy the condition “[(a,b) = (a,c)] = [b = c|”. We have declared that a 
function is an invertible function only if f is one-to-one. But when viewing f as 
a relation (a subset of A x B) we can speak of its “inverse”, as a relation, even 
though it is not one-to-one. That is, if f = {(x, y) : y = f(x)} the inverse, f7}, 
of f is 


f ={(y,2):y = f(a), € dom f} = {(y, 2) : (x,y) € f} 


There is no contradiction here. But we should be more specific by saying: If f is 
not one-to-one we can speak of its inverse f—!, with the caveat that f~! cannot 
be referred to as a “function”. So, when we say that “f is invertible if and only 
if it is one-to-one”, we actually mean “the inverse f~! of the function, f, is a 
function if and only if this function f is one-to-one”. 
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We consider the following example. Let A be a set, {a,b} C A where a Æ b and U be 
a non-empty subset of A. Consider the function f : A — A defined as follows: 


a if xeU 
ra= if sgU 


Then f can be described as 
f=[|U x {a}] U [(A-U)x {b} 


Its inverse 


f° =[{a} x U] U Ho}x(4A-0) 


can only be referred to as a relation on A, unless of course both U and A — U are 
singleton sets. 


Concepts review: 


1. Given two functions f : A — B and g : B — C what does the expression “gof” mean? 
Under what conditions does this expression make sense? 

2. Is the composition of functions commutative? Are there any pairs of functions which 
always “commute” with each other? 

3. Under what condition(s) is the composition of functions associative? 

4. Which function plays the role of the identity with respect to “°”? 

5. What does it mean to say that a function f is “invertible”? 

6. Under what condition(s) is a function invertible with respect to “°”? 

7. If a function h can be expressed as h = gof where both f and g are invertible, is A 
invertible? If so, how can we express h~!? 

8. If f is not one-to-one on its domain what interpretation can we give to the expression 
fal 

EXERCISES 


A. 


1. Suppose f : A — B and g : B > C are functions and D C A. Prove that (gof)|p = 
ge(f |p): 
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. Let f: A — Band g : B — C be two functions. 


a) Prove that if (gof) : A — C is one-to-one, then f : A — B is one-to-one. 
b) Prove that if (gof) : A — C is onto C, then g is onto C. 


. Let g: B — C and h : B — C be two functions. Suppose gof = hof for every 


function f : A — B. Prove that g = h. 


. Let g: A — B and h : A — B be two functions and let C be a set with more than 


one element. Prove that if fog = foh for every function f : B — C, then g =h. 


. Prove the statements a) and b) of theorem 10.8. 
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11 / Images and preimages of sets. 


Summary. Suppose we are given a function f : A — B. In this section we “el- 
evate” this function so that it acts on P(A), mapping its elements to elements 
of P(B) according to the rule determined by f. This provides a mechanism by 
which we can study the image of sets under the function f. We also define the 
set-valued inverse of a function, f€, and provide some examples. We also show 
how such functions act on unions and intersections of sets. 


Functions mapping sets to sets. 
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The domain of all functions in this section are hypothesized to be sets. Given a func- 
tion, f : A — B, it is sometimes useful to see what effect the function has on subsets 
of the domain rather than simply on its elements. To study the action of functions 


on sets we will introduce some special notation. 


Definition 11.1 Suppose f is a relation with the set A as domain and the set B as range. 
If S is a subset of A, then we will represent the image of the set S under f as 


IS] ={y € B: (x,y) € f and z € S} 


If U C B, we will refer to the set 


f [U] = {x € A:(xz,y) € f andy €U} 


as the preimage of the set U under f. 


Remarks. What is new in this definition? 


— First observe that the expression f[S] — notice the square brackets — is the 


image of S under f. So the square brackets are not there just as a matter of 
style. They have meaning. The function, f| ], associates elements of A(A) to 
elements of Y(B) where f is predefined either as a relation or, more specifically, 
as a function. 


The symbol, “f4”, and the words, preimage of a set, are new. If f is a function, 
then f“ [U] is simply the image of U under the relation f~!. Again notice the 
square brackets which means we are associating sets to sets, an association gov- 
erned by f*.! (If f is a function and x € im f, in some branches of mathematics 


‘Note that use of the notation f[U] is not universal. It is introduced here to avoid confusing the 
preimage of an element, f~'(x), normally used with one-to-one functions, with the preimage, f~'[U], of a 
set U. A general topologist might refer to “B = f“ [U]” by saying that “f pulls back the set U to the set 


BY. 
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f[{z}] is referred to as the fiber of x under the function f.) 


Examples. 


1) 


Let B = {a,b,c} and U be a non-empty proper subset of A. Consider the 
function f : A — B defined as follows: 


a if xEeEU 
fea) ={ 5 if x¢U 
Then 


fU]={a} and fo[{a}] =U 
f[A-U]={b} and f [{b}]=A-U 
fl@]={c} and fH} =Z 


We see that the preimage of {c} is empty since f maps no elements of A to c. 
This is another way of saying that c is not in the range of f, or, c ¢ f[A]. 
Suppose f : A — P(A) is defined as f(x) = {x}. If {a} F {b}, then a Æ b, hence, 
in this case, f is a function. If B C A, then f[B] = {{z}: 2 € B} C P(A). In 
this case f maps the element B € Y(A) to the element f[B] € A(A(A)). For 
x,y E€ A, FO Hizh {uJ} = {2y} 


Let A = {a,b,c,d,e,k,h} and B = {u,v,w,z,s} and let D = {e,k,h} and 
E = {c,d}. 

We define f = {(a, u), (b, u), (c, u), (d, v), (e, v), (k, z), (h,s)}. We describe the 
function f via images and preimages. 


fHu} = {a,b,c} 
f- Ho} {d,e} 
F- K2 {k} 
flolD] = f|pl{e, k, h} {v, 2, s} 
(flo) [tek] = {e} 
(fle\[E] = (flee, d} {u,v} 
(Fle) Hu} = {c} 








11.2 Images and preimages of unions and intersections of sets. 


Since we have defined how a function f : A — B can be elevated to 

f: P(A) — A(B), mapping sets to sets, we can determine how such functions behave 
when f acts on unions and intersections of sets (or classes). We present these few 
properties in the form of a theorem. While reading through these properties, we will 
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yl 


see that f€ always “respects” unions and intersections. The function f will be seen 


to “respect” unions; but f will “respect” intersections only in certain circumstances. 


Theorem 11.2 Let f : A— B bea function mapping the set A to the set B. Let & be a 
set of subsets of A and Z be a set of subsets of B. Let DC Aand EC B. Then: 


a) f [sew S| = Usew f [S] 


b) f [Asew S] © Nsew f [S] where equality holds true only if f is one-to-one. 
c) f[A—D] C B — f[D]. Equality holds true only if f is one-to-one and onto B.? 
d) Am lUseg S] = Usez f= [sS] 
e) fT [Nsez s] = Nsez fT [S] 
f) f7 [B - E] = A- f“ [E] 
Proof: 
a) res|U s| < x = f(y) for some yE |) S 
SEA SEA 
= x= f(y) for some y in some S € & 
= x= f(y) € f[S] for some SE & 
e xe Fis 


SEA 
b) It will be helpful to first prove this statement for the intersection of only two sets 
U and V. The use of a Venn diagram will also help visualize what is happening. 


So we first prove the statement: f[U O V] C f[U] A f[V] with equality only if f 
is one-to-one on U U V. 


Case 1: We consider the case where U N V = Ø. 
Then f [UNV] = Ø C f[U] N f[V]. So the statement holds true. 


Case 2: We now consider the case where U N V # Ø. 
xe f[lUnvVv] = oo ) for some ye UNV 


< f(y) for some y contained in both U and V 
=> f(y) € f[U] and f[V] 
S ref € fU 1n fiV] 


‘We say that f respects unions if it is always true that f[A U B] = f[A]U f[B]. Similarly, f respects 
intersections if it is always true that f[AN B] = f[A]N f[B]. 
?Remember that A — B = ANB’ equals A intersection the complement of B. 
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We now show that if f is one-to-one on UUV, then f[U] A f[V] C f [U A V] and 
so equality holds true. 


— Suppose x = f(y) € f[U] N f[V]. Then there exists u € U and v € V such 
that f(u) = f(v) = f(y). Since f is one-to-one, u = v = y. This implies 
y€ UNV. Hence, f[(UNV] = f[U] Aa f[V]. 


The proof of the general statement is left as an exercise. 


Proof is left as an exercise. 


ce fr | U s| = x= f(y) for some y € U S (By definition of f7.) 
SEZ SEZ 
= x= f(y) for some y in some SE Z 


= xef Hy} C fols] for some Se & 
s «eV fis] 
SEB 
Thus, f~ (Usea 9) =Usea FT (9). 
Proof is left as an exercise. 


Proof is left as an exercise. 


Concepts review: 


1. Given a function f : A— B and S C A, what does the expression f |S] mean? 


2. Given a function f : A— B and S C B what does the expression f“ [S] mean? 


3. What is the preimage of a set S under a function f? 


4. Given a function f : A — B and x € B — im f, what is f€ ({x})? 


5. Under what conditions does f respect unions? 


6. Under what conditions does f respect intersections? 


7. Under what conditions does f“ respect unions? 


8. Under what conditions does f*~ respect intersections? 
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EXERCISES 


1. 


CON ON 


10. 


11. 


12. 


Give an example of a function f : A — B where A contains two unequal non-empty 
subsets D and E satisfying f| D] = f[E]. 


. Let f : A — B be a function where A and B are sets. 


a) Prove that if D and E are equal subsets of A, then f[D] = f[E]. 
b) Prove that if U and V are equal subsets of B, then f€ [U] = f* [V]. 


. Suppose f : A — B where A and B are sets. 


a) Show that for any subset D of A, D C f“ (f[D]). 

b) Give an example where D # f“ (f[D]). 

b) Show that for any subset F of B, f[f~[E]] C E. 

d) Is it necessarily true that f[f[E]] = E? 

e) Prove that if f is one-to-one on A, then, for any subset D of A, D = f~[f[D]]. 
f) Prove that if f is onto B, then, for any subset E of B, f[f [E]] = E. 


. Let S and T be sets and f : SxT — S bea function on SxT defined as: f((x, y)) =z 


for all (x,y) € S xT. 
a) If u € im f what is f€ Hu}? 
b) If U is a non-empty subset of S what is f“ [U]? 


. Prove the general case of part b) of theorem 11.2. 
. Prove part c) of theorem 11.2. 
. Prove part e) of theorem 11.2. 
. Prove part f) of theorem 11.2. 


. Let f : A — B be a function mapping the set A to the set B. Prove that if D C A, 


then 

FF (FDI = FID] 
Let f : A — B be a function which maps the set A onto the set B. Prove that 

fB)= U fle 

ceEB 
Let f : A— B be a function which maps the set A onto the set B. Prove that if x 
and y are distinct elements of B, then 
Feo f Hy} = 2 


Let f : A — B be a function which maps the set A onto the set B. Prove that the 
set of sets 


F = {f Hz}: xE B} 


forms a partition of the set A. 
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12 / Equivalence relations induced by functions. 


12.1 


Summary. In this section we show how a function f : S — T partitions its 
domain S. This partition induces an equivalence relation Rs on S which in turn 
leads to the quotient set S/Ry. We then present a theorem which shows how any 
function can be expressed as the composition of two functions, neither of which 
is f itself or the identity function. 


Partitioning the domain of a function f : A — B. 


Suppose f : A — B is a function which maps a set A into a set B. 
We claim that the set {f Hx} : x € f[A]} C A(A) forms a partition of A. 


— Since every x € f[A] is in the image of f, f~ Hzr} is non-empty for all x € f[A]. 


— If x Fy, f ({{xr}) N f ({y}) = Ø otherwise an element z E€ f~({x})N fT Uy} 
would be mapped to distinct points, contradicting the fact that f is a function. 


— Finally, the function f~ sends the image, f |A], of A back to A, i.e., f~[f[A]] = A. 


So A = f [FLA] = f Usesta) = Ure sta fry). Hence, {f7 Hr} : x € 
f[A]} covers all of A. 


The set. {f€ ({x}) : x € f[A]}, forms a pairwise disjoint set of sets which covers all 
of A. So this set partitions A, as claimed. 


We have seen that the partition of a set is the quotient set of some equivalence rela- 
tion, R, (see theorem 8.3). The equivalence relation, R, induced by a partition on a 
set A was defined as follows: 


Two elements of the set A are related under R if and only if they belong to 
the same element of the partition induced by R. 


Then, given a function, f : A — B, on A, we can declare that two elements a and b 
in A are related under a relation Ry if and only if they appear together in f He} 
for some zx in the image, f[A], of A. So the set of subsets of A, {f He} : x € f[A]}, 
is a quotient set of A induced by Ry. We formalize these thoughts in the following 
definition. 


Definition 12.1 Let f : A — B be a function which maps a set A into a set B. We define 
the equivalence relation, Rr, on A induced (or determined) by f as follows: 


‘We remind the reader of the definition of a partition (also found at Definition 8.1). For a set S we say 
that a set of subsets @ C P(S) forms a partition of S if 1) Uac 4 = S, 2) If A and B € Ẹ and AF B, 
then AN B = Ø, 3) A#@ forall AE @. 
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Two elements a and b are related under Ry if and only if {a,b} C f[{x}] for 
some x in im f. The quotient set of A induced by Ry is then 


A/Ry = Pr, = {f Ha} : x € FIA} 


We will refer to A/Ry (or &p,) as the quotient set of A induced (or determined) by f. 


12.2 


We illustrate this in a simple example. Let U = {a, {a}, {a, {a}}, {{a}} } where a is 
a set. Consider the function f : U — U defined as follows: 


a if acr 
{as < {a} if {a}ervanda€ge 
{{a}} if w=a 


We see that f is a well-defined function on the set U. So the set 


€ =F Hall, Fah Fite t 


partitions U in three pairwise disjoint non-empty sets. From this, we can define the 
equivalence relation Rr on U where U/Ry = @. 
We will list the elements in each set: 


F Kaol = { tat, {a {a}}} 
f [taht] = { {lah} } 
F Hahl = tat 


Observe that all elements of U are represented in the three sets above. 


The canonical decomposition of a function. 


Let f : S — T be a function mapping a set S into a set T. We have seen how this 
function, f, determines a new set: the quotient set, S/Ry, induced by f. For each 
x E€ S we let S, = f~[{f(x)}]. Then 


S/R = {Sz : £ E S} 


is the quotient set induced by f. Note that the elements of S/R, are subsets of S. 
We now show how the function f can be expressed as a composition of two other 
functions neither of which is the identity function. 


The figure below illustrates how we will express the function f as a composition of 
two functions. 
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S/R, 


Figure 4: Canonical decomposition of f : S — T 


A. We define the function g, : S —> S/Ry as follows: 


Gy (x) = Sy 
We first verify that g,:S— S /Ry is a well-defined function on S. 


— We first verify that domg, = S: 


Let x € S. Then y = f(x) € f[A] = imf. So Ss = f[{y}]. Thus, 
9, (x) = Sz = fT {y} E€ S/ Ry. 
— Next we show that g, is indeed a function: 


Suppose Sr # Sy. Since {Sz : x E S} partitions S, then 


Sr N Sy = fF KFN STRO = 2 


So x Æ y otherwise we would have f~|{f(x)}] = f[{f(y)}] resulting in a 
contradiction. Then g, is a function as claimed. 


We now verify that the function gy is onto S/Re: 
— Since, for any x € S, Ss = f[{f(x)}] = g;(x), then g; is onto S/R 
B. We define another function h, : S/ Rf — T as follows (Remember that T contains 


the image if f): 
h, (Sz) = f(z) 


We verify that h, : S/Ry — T is a well-defined function on S/Ry: 
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— We first verify that dom h, = S/Ry: 


Let Ss € S/Ry. Since x € S, then f(x) is defined and so h, (Sx) = f(x) is 
defined. 


— We now show h, is indeed a function: 


Suppose f(x), f(y) € h,[S/Ry] and f(x) # f(y). Then Ø = f[{f(x)}}N 
SOKS) = Ss N Sy. So Sz A Sy. Then h, is indeed a function. 


The function hy is one-to-one: 
The function h, is one-to-one on $/Ry, since 


Se # Sy > SyNSy=B 
= fF KF@MHOF FW) = 2 
=> fx) F fy) 


C. Combining the two functions g, : S > S/Ry and h, : S/Rf — T we obtain the 
composition (h,og,) : S — T where 


(hyeog,)(a) > f(x) 


We verify that the function (h,og,) = f: For x € S, 


f(z) = h,(Sz) 
h,(g,(2)) 
= (hog) (2) 


Thus, the two functions h,og, and f agree everywhere on the domain, S, of f. 
We have just proven the following theorem. 


Theorem 12.2 Let f : S — T be an onto function where S$ and T are sets. There exists 
an onto function g, : S — S/Ry and a one-to-one function h, : S/R —> T such that 


hp oGs = f 


The function, h,og, = f, is called the canonical decomposition of f. 


Example: Let U = {a, {a}, {a,{a}},{{a}} } where a is a set. Let the function f : 
U — U be defined as follows: 


a if acr 
f£) = | {a} if {a}cxandag¢g 
{a} if Se 


A. 
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From this we can define the equivalence relation Rf on U where 


Uli aa F- [fap], Fah], FOH 3 
{ {diah tadah h Hoh {a} } 


We can define gp : U +> U/Ry and hp : U/Ry + U as below: 


a ee {{{a}}} = fF [{{a}}] ot {{a}} = f(a) 
{a} = {fa}, {a {a} = fo Hay PE a = f({a}) 
{a,{a}} == {{a}, {a,fa}}} =f Hay PA a = f({a,{a}}) 
{{a}} -S {a} =f KHA] = {a} = f({{a}}) 
We see that 
(hpogs)(a) = f(a) 
(hrogp)({a}) = Faa} 
(hpogr)({a,{a}}) = fla, tat}) 
(hrog) ah = Fala) 


Concepts review: 


1. If f : A — B is a function mapping the set A into the set B, describe a partition of 
the set A induced by the function f. 


2. If f: A — B is a function mapping the set A into the set B, describe and equivalence 
relation Rf on A induced by f. 


3. If f : A — B is a function mapping the set A into the set B, describe the elements of 
the quotient set A/Rş induced by f. 


4. If f : A — B is a function mapping the set A into the set B, what does “the canonical 
decomposition of f” mean? 


5. If f : A — B is a function mapping the set A into the set B, is it always possible to 
“decompose” f as a composition of two functions? How? 


EXERCISES 


1. Let S = {a,b,c}, be a set containing three distinct elements. 
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a) List all the elements of A(S). 
b) We define a function f : P(S) x S — Y(S) as follows: 


ADEL toy it ced 


List all the elements of the function f. 

c) Is f onto A(S)? Explain. 

d) Is f one-to-one on S? Explain. 

e) For every D € P(S), give f(D). 

f) If the function f determines a partition of A(S) x S list the subsets which are 
members of this partition. 


3) Let A be a non-empty subset of the set S and let T = {@,{@}}. We define the 
function f : S — T as follows: f(x) = Ø if x ¢ A and f(x) = {Ø} if x € A. Let Re 
denote the equivalence relation determined by f. 

a) List the elements of the quotient set S/Ry. 
b) If hfogf = f is the canonical decomposition of f, list the elements of the functions 
gf and of hy. 
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Part V 


From sets to numbers 


105 
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13 / The natural numbers. 


13.1 


13.2 


Summary. The main objective in this section is to discuss how the natural num- 
bers, N = {0,1,2,3,...}, are constructed within the Zermelo-Fraenkel axiomatic 
system. We begin by stating the definitions of “successor set” and “inductive 
set”. The natural numbers, N, is then defined as the “smallest” inductive set. A 
ZF-axiom will guarantee the existence of this “smallest inductive set” called the 
“natural numbers”. We then show how the Principle of mathematical induction 
is an immediate consequence of this definition of N. We define “transitive sets” 
as sets, A, whose elements are subsets of A. The elements of N are then shown 
to be “transitive sets”. We then prove a few properties possessed by all natural 
numbers. 


Preliminary discussion. 


We now have enough background material to appropriately define the set commonly 
known as the natural numbers. We have an intuitive understanding of what the num- 
bers 0,1,2,3,..., mean and so we will let our intuition guide us in our attempt to 
define N within our set-theoretic axiomatic system. 


When defining N what are our options? If we want N = {0,1,2,3,..., } to be defined 
within the ZFC-axiomatic system then we don’t have much choice in the matter: The 
elements of N must be sets. But they are sets whose elements have certain character- 
istics. These characteristics are such that nobody would confuse N with the real or 
complex numbers, for example. A way of approaching this question is to ask ourselves 
what properties of the natural numbers allow us to say with confidence that 1/3 or 
V2 are not natural numbers. One obvious property of N is that it is an infinite set. 
We would necessarily have to define what an “infinite set” is. We then would have 
to think deeply about the fundamental characteristics of N and its elements. For 
example, any set which represents a non-zero natural number must have an immedi- 
ate predecessor and an immediate successor. We must ask ourselves, “what kind of 
set can have an immediate predecessor and an immediate successor?”. Furthermore, 
we will eventually have to determine how arithmetic operations can be performed on 
such sets. Determining the natural numbers’ intrinsic properties which allow us to 
distinguish them from other types of numbers must thus be our starting point. 


Constructing the natural numbers. 


As we reflect on sets which would suitably represent natural numbers we come to 
realize how very few sets we have actually witnessed up to now in our study of set 
theory. What kind of sets have we encountered? 


1) First, we gave ourselves an axiom (Axiom of class construction) which guarantees 
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that {x : x # x} is a well-defined class. We decided to represent this class by 
“@” and call it the “empty class”. 


2) Then we gave ourselves an axiom (Axiom of subset) that says that “if S is a set, 
then any subclass of S must also be a set”. We gave ourselves an axiom that 
guaranteed that there exists at least one set (The Axiom of infinity states that 
“there exists a non-empty class A called a set such that...”). Once we showed 
that @ C S, for any set, then @ was our first explicitly constructed set. 


3) We then gave ourselves set constructing tools. The Axiom of pair allows us to 
say, for example, that {9}, {2,{@}}, {{9, {@}}},... are sets. The Axiom of 
union allows to gather together all the elements from a “set of sets” to form a 
larger set. The Axiom of power set allows us to construct a set whose elements 
are the subsets of a set. 


From this we see that nearly all the sets we explicitly constructed up to now have 
evolved from successive applications of the axioms of pair, union and power set, with 
the empty set as a starting point. If we explicitly list the elements of these sets we 
will see repetitive sequences of “curly brackets” “{” and “}” and the symbol “@”. We 
then expect every natural number to be a set of this nature. 


If we are asked to define the set of all natural numbers as succinctly as possible we 
may consider the following definition as a reasonable one: 


The set, N, of all natural numbers is the intersection of all sets S which 
satisfy the two properties, 0 € S and [n € S| > [n+1 € S]. 


Given the knowledge and the experience we have with natural numbers, it would be 
difficult to imagine a natural number which does not belong to such a set. It also 
seems obvious that numbers such as 3 and y5 cannot belong to such a set. This 
will be our model for formulating a set-theoretic definition of the natural numbers. 
It seems natural to define, 0 = @, as being the smallest of all natural numbers. The 
challenge is to define the operation “+ 1” using the language of sets. We can view 
“+ 1” as an “immediate successor constructing mechanism”. We begin with the fol- 


lowing definition. 


Definition 13.1 For any set x, we define the successor, xt, of x as 


gt =xU {a} 


We see that this is an operation which adds a single element to a previously known 
set. For example, if A = {a,b,c}, then the successor of A is 


At = {a,b,c} U {{a, b, c}} = {a, b,c, fa, b, c}}+ 


Note that if x is a set, then the expression x” 


the elements of x are. 


= z U {x} cannot be simplified since we do not see what 
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This is a set constructing mechanism. We need only one set to initiate a non-ending 
process. Given any set, we can construct a successor. By the axiom of union (A7) 
a successor is always a set, provided the class which initiates the process is a set. 
Starting with the empty set @, we obtain 





Bo = Ø 
B, = Bt =øU{Ø} = {Ø} 
By = (Ø*)* = {O}* = {Ø} U {{Ø}} = {2, {o}} 
Rather than use the symbols, {Bo, B1, B2,...,}, why not use conventional natural 
number notation: 
0 = © 
1 = 0t=@ot =@U{o}= {eo} = {0} 
2 = 17 ={o}* ={o}U{{o}} = {2, {2}} = {0,1} 
3 = 2 ={9,{o}}* = {Ø, {o}} U {{a, {3H} = {2, {2}, {2, {o}}} = {0, 1, 2} 
We can thus define, one at a time, each symbol 0,1, 2,3,..., as a set. Let’s continue 


for a bit and see what happens. We will define: 


4 = 3° ={2, {Ø}, {P, {Sh}, (2, {2}, {S, {S}}} = {0, 1, 2, 3} 
5 4° = {0, 1, 2,3, 4} 

6 = 5° = {0,1,2,3,4,5} 

7 6" 





" = {0, 1, 2,3, 4,5, 6} 


This method for constructing natural numbers is worth exploring. 


We further examine the properties of a set constructed in this way. 


— As mentioned before, construction starts with the set 0 = @ moving upwards. 

— For each of the elements of the set, 7 = {0,1,2,3,4,5,6}, we see that the property 
“n C n*” is satisfied. And surprisingly enough, n € n+. That is, each n is both a 
subset and an element of its successor n*. For convenience, we define the ordering 
relation “n < m” to mean “n C m” sothat0O<1<2<3<4<5<6< 7, 
for example. Witness 3 = 2* = {0,1,2} = {0, 1, {0,1}}, so 2 = {0,1} C 3; hence, 
2<3. 

— For each element, n, of the set 7 = {0,1,2,3,4,5,6}, we see that n is precisely the 
set of all “natural numbers” strictly less than itself. 

— Also, if we set 0 = Ø 


1={o}={0} so OE€land0cl 

2 = {0,1} = {0,{0}} so 1€2andic2 

3 = {0,1,2} = {0, {0}, {0,{0}}} so 2€3and2c3 
le3and1c3 
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The set, 7 = {0,1,2,3,4,5,6}, satisfies the property: Every pair of elements con- 
tained in the number 7 are comparable with respect to < and they satisfy the 
property: 
(n< m) => [|(nE€ m) & (nc m)] 
We want to generalize to all of N the properties we have just witnessed for the numbers 


0, 1, 2,..., 7. A first draft of a definition of a natural number may look something 
like this: 


We say that n is a natural number if it is either the empty set @ or equal 
to m U {m} for some other natural number m. 


Then we would have to prove that “for any pair m,n of distinct natural number, 
(m € n) & (m Cn)”. From this we would conclude that both € and C are strict 
linear orderings of the set of all natural numbers. 


Based on such a definition, and the few examples we have seen above, we could 
conclude that all the elements of 7 = {0, 1, 2, 3, 4,5,6} are natural numbers. Further- 
more, we could show that 7 is a natural number. We can then construct 8 = 7+ = 
{0, 1, 2,3,4,5,6, 7} and see that the elements of 8, as well as 8 itself, are all natural 
numbers. 


Since the expression, n U {n}, is at the core of everything we have seen above, we 
formally provide some vocabulary to discuss such a concept. 


Definition 13.2 If x is a set, then we define 
gt =xU{a} 
A set, A, is called an inductive set | if it satisfies the following two properties: 
a) Ø EA. 


b) re ASatead. 


“Inductive set” as defined above nicely describes, in a nutshell, the class N of natural 
numbers. But we require an axiom which guarantees that at least one induction set 
exists. This is done with the Aziom of infinity (A8). 


Axiom (The axiom of infinity): An inductive set exists. 


This axiom of infinity provides us with the minimum amount of raw material to do 
finite mathematics. 

Now that we have given ourselves at least one inductive set, we will define the natural 
numbers as being the smallest one. 


1The term successor set is also used instead of inductive set. 
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Definition 13.3 We define the set, N, of all natural numbers as the intersection of all 
inductive sets. That is, 


13.3 


N= {x: a € I for all inductive set I} 


To say that N is the intersection of all inductive sets is another way of describing the 
set of all natural numbers. Is the set N itself inductive? We verify this: By defini- 
tion, all induction sets contain the element Ø and so Ø belongs to their intersection, 
N. Condition one is satisfied. We verify condition two: If x € N, then x belongs to 
all inductive sets and so xt must belong to all inductive sets; so xt € N. So N is 
an inductive set. It immediately follows that if n is any natural number, then so is 
its successor, n*. We will now verify that N, thus defined, actually satisfies all the 
properties that are expected from it. 


Mathematical induction. 


The cleverly chosen four words “An inductive set exists” will allow us to prove that 
the smallest inductive set provides a precise set-theoretic representation of the set 
of all natural numbers as we know it. This inductive set possesses all the essential 
properties of the natural numbers, including its linear ordering structure. As we shall 
soon see, it will allow us to define on it the common arithmetic operations we nor- 
mally perform on natural numbers. Proving that this inductive set possesses all the 
essential properties of the natural numbers will require the well known mathematical 
tool called the Principle of mathematical induction. This principle is “hardwired” 
within the definition of “inductive set”. 


Theorem 13.4 Let A be a subset of N. If A satisfies the two properties: 


a) OEA 


b) mE ASmtedA 


then A=N. 


Proof: 


By hypothesis, A is an inductive set since it satisfies the two required properties. Since 
N is the intersection of all inductive sets, then N C A. By hypothesis, A C N. Thus, 
A=N. 
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Corollary 13.5 (The Principle of mathematical induction.) Let P denote a particular set 
property. Suppose P(n) means “the property P is satisfied depending on the value of the 
natural number n”. If 


a) P(0) holds true, 
b) P(n) holds true = P(n*) holds true. 


then P(n) holds true for all natural numbers n. 


Proof: Let 
A = {n € N: P(n) holds true } 


Part a) of the hypothesis states that “0 € A”, while part b) states that [n € A] > [nt € 
A]. Then A is an inductive set and so A = N (by the theorem). So P(n) is true for all 
natural numbers n. 


A few remarks. The proofs above illustrate how the Principle of mathematical in- 
duction is intrinsically linked to the definition of the natural numbers. The set of all 
natural numbers is the only set whose existence is essentially postulated. The other 
explicitly defined set is the empty class which was shown to be a set (as a consequence 
of the Axiom of construction followed by the Axiom of subset). 


Some readers may not be familiar with “proofs by mathematical induction”. These 
readers will benefit from the many examples of proofs by induction in this section. 
It is well worth taking a few moments to summarize the main steps to be followed 
when proving a statement by induction. Induction is used when we are dealing with 
some property P(n) which is a function of the natural numbers. Let S = {n € 
N : P(n) holds true}. Now this set, S, may possibly be empty, may contain a few 
elements of N or may even contain all of its elements. The objective is to show that if 
two specific conditions are satisfied, then S = N. That is, we want to prove that P(n) 
holds true for all values of n. For example, suppose P(n) is described as the property 


n(n + 1) 


Le Sa y 


We want to prove that this holds true no matter what natural number n we use. We 
highlight the main steps. 


Step 1: Write down explicitly the property which is a function of n as illustrated above. 


Step 2: Prove the “Base case”. This means that we must prove that P(0) is true. 
In our example, we are required to show that 0 = oo) = 0. We see that the base 
case holds true. If the property cannot be shown to be true for the base case, then 
P(n) does not hold true for all n. It sometimes helps us to understand what is going 
on if we prove that both P(0) and P(1) are true (especially when the base case is 
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13.4 


“vacuously true” ). 


Step 3: State the “Inductive hypothesis”. In this step we suppose that the P(n) 
is true for some unspecified natural number n. That this property holds true for a 
particular n is now considered to be “given”. 


Step 4: With the help of the assumption that P(n) is true, prove that P(n*) (equiv- 
alently P(n + 1)) is true. In our example we would write something like this: 


























1424--¢n= Ret) 1+24+---+n m+- tD imn 
1424+---4+n (nti) = Vea 


Step 5: Write down the conclusion: Since “P(n) is true” implies that “P(n + 1) is 
true”, then, by the principle of mathematical induction, P(n) holds true for all n. 


Difficulties encountered when applying this procedure are often due to skipped steps. 


Transitive sets. 


The few examples of natural numbers constructed above have illustrated an interest- 
ing property: Each element of a natural number, n, is seen to be subset of n. We 
formally define the words used to express sets which satisfy this property. We then 
provide in the form of a theorem a useful characterization of such sets. 


Definition 13.6 A set, S, which satisfies the property “x € S => x C S” is called a tran- 
sitive set. 


Theorem 13.7 The non-empty set, S, is a transitive set if and only if the property 


|x € y and y € S] => [z € S] 


holds true. 


Proof: 


(=) What we are given: That S is transitive, x € y and y € S. 
What we are required to show: That x € S. 
Since S' is a transitive set y C S. Then x € y C S implies x € S. 


(<) We are given that “(x € y and y € S) > (x € S)” and z€ S. 

We are required to show that “z CS”. 

If z = Ø, then z C S and we are done. Suppose that z 4 Ø. Let a € z. By hypothesis, 
a € S. Since a € z implies a € S, then z C S. 
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The above characterization (x € y and y € S) > x € S relates more closely to our idea of 
transitivity (that is, a<bandb<c>a<c). 


The next theorem shows that N is a transitive set. 


Theorem 13.8 The set N of natural numbers is a transitive set. 
Proof: 


By the characterization of transitive sets stated above, it suffices to show that for each 
n EN, renszeEN. We will prove this by mathematical induction. Let P(n) denote 
the statement “(xe nE N) >xzeN”. 


— Base case: The statement, “x € 0 = Ø” = “x € N”, is true since there are no elements 
in 0 = Ø. So P(0) holds true. 


— Inductive hypothesis: Suppose the statement P(n) holds true for the natural number 
n. We are required to show that P(n*) holds true. Suppose y € nt = nU {n}. Then 
either y € n or y € {n}. If y E€ n, then by the inductive hypothesis, y € N. If y € {n}, 
then y=n €N. 


By mathematical induction, the statement holds true for all elements of N and so, by 
definition, N is a transitive set. 


The following theorem first establishes that no natural number is an element of itself. 
It also shows that for distinct natural numbers one is an element of the other if and 
only if it is a proper subset of the other. This of course implies that every natural 
number is a transitive set. 


Theorem 13.9 


a) For natural numbers n,m, m E€ n = m C n. Hence, every natural number is a 
transitive set. 


b) For any natural number n, n Æ nt. 


c) For any natural number n, n g n. 


d) For any distinct natural numbers n,m,m Cn>men. ! 


'The reader is cautioned not to misread the statement “m C n > m € n”. It does not say that any 
subset of a natural number n is an element of n. It says that “any natural number which is a subset of n is 
an element of n”. 
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Proof: 


a) This is a proof by mathematical induction. Let P(n) be the property “m and n are 


b) 


natural numbers and m E€ n > m C n”. We are required to prove that the set 

{n € N: P(n) is true } =N. 

— Base case: We claim that P(0) holds true. Recall that 0 = Ø. Suppose P(@) is 
false. Then there must be some x € Ø such that x æ. This is absurd since Ø 
does not contain any elements. So P(0) holds true. 

— Inductive hypothesis: Suppose that for some natural number n, P(n) holds true. 
We are required to show that P(n*) holds true. 

* To show that P(n*) is true, suppose m € nt = nU{n}. We are required to show 
that m Cnt =nU {n}. 
Case 1: If m € n then by the inductive hypothesis, P(n) is true, and so m C n. 
Then m C n* = nU {n} and so P(n*) is true. 
Case 2: Suppose m ¢ n. 


mgnandment =nU{n} > me {n} 


> m=n 


Clearly m =n C nU {n}. Then P(n*) is true. 
We have shown that if P(n) is true, then P(n*) is true. By mathematical induction 
P(n) is true for alln € N. We conclude that every natural number is a transitive set. 
We prove that n Æ n* by induction. Let P(n) denote the statement “n Æ n U {n}”. 
Base case: Since Ø = { } # {Ø}, P(0) holds true. 


Inductive hypothesis: Suppose n 4 n U {n} for some natural number n. 
We are required to show that n U {n} Æ nU {n} U {nU {n}}. 


Now 


nU{n}U{nU{n}}=nU{n} > nU{n} env {n} 
=> nU{n}enornU{nh=n 


The inductive hypothesis does not allow “n U {n} = n”. So nU {n} € n. By part 
a), nU{n} C n. Since n C nU {n}, then n = n U {n} (Axiom of extent) again 
contradicting the inductive hypothesis. Then n U {n} U {n U {n}} # nU {n}. So 
“P(n) = P(n*)” holds true. 

By mathematical induction, n 4 n U {n} for all natural numbers. 

Suppose n is a natural number such that n € n. Then nU{n} C n. Since n C nU{n}, 
n =n U {n} contradicting the statement of part b). We must conclude that n ¢ n. 
We are required to show that for all m,n €N, m Cn =m €n. We will prove this 
by mathematical induction on n. Let P(n) be the property “[m is a natural number 
and m C n] > [m E n]”. 
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— Base cases n = Ø or 1: For n = Ø, the statement m C Ø > m €E Ø is vacuously 
true. For n = 1, Ø C 1 = {Ø} and Ø € 1 = {Ø} hold true. So both base cases P(0) 
and P(1) hold true. (Actually showing P(0) holds true is sufficient.) 

— Inductive hypothesis: Suppose n is a natural number such that P(n) holds true; 
that is, for any natural number m, “m C n > m €E n”. We are required to show 
that for any natural number m, “m C nt > ment”. 

* Let m be a natural number such that “m C nt =nU{n}”. 
Case 1: If n g m, then m C n. By the inductive hypothesis, m € n C nU {n}. 


Hence, m € nU {n}. 


Case 2: Suppose n € m. By part b), m Æ n. Since m and n are distinct natural 
numbers, by part a), n C m. Then nU {n} C m. Since m C n U {n}, then 
nU {n} C nU {n}, a contradiction. 

So only case 1 applies. So P(n*) holds true. 


By the principle of mathematical induction m C n > m € n for all natural numbers 


m and n. 


To illustrate how the elements of N satisfy the property “x € n => x C n” consider, 
for example, the natural number 4, 


4={Ø, {Ø}, 19,1}, 13, {12}, {2,{2}}} 


We see that 


0O=GE4 

1={o}e4 

2 ={Ø,{Ø}} E4 

3 = {2, {3}, {2, {o}}} E4 


and 
and 
and 


and 


0O=o0c4 

1= {Ø} C 4 Since every element of {Ø} belongs to 4. 

2 = {Ø,{Ø}} C4 Since every element of {Ø, {}} belongs to 4. 
{2 {2} {2 {ØH} C4 


Since every element of {Ø, {Ø}, {@,{@}}} belongs to 4. 


13.5 Other basic properties of natural numbers. 


We prove a few more properties of the set, N, as defined above. 


Theorem 13.10 Let m and n be distinct natural numbers. 


a) Ifm Cn, then m+ Cn. 


b) Let m and n be any pair of distinct natural numbers. Then either m C n or n C m. 
Equivalently, m € n or n € m. Hence, both “C” and “eE” are strict linear orderings 


of N. 


c) There is no natural number m such that n C mC nt. 


Proof: 
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a) 


13.6 


What we are given: That m and n are distinct natural numbers where m C n. What 
we are required to show: That mU{m} Cn. 

Since m C n, then m € n (by theorem 13.9). If n = mU {m}, then we are done since 
mU{m} Cn. Suppose n 4 mU{m}. Then “m C n” and “m E€ n” together imply 
that mU {m} C n. Hence, mt = mU {m} C n, as required. 


What we are given: That m and n are distinct natural numbers. What we are re- 
quired to prove: That m Cn orn Cm. 


We will prove this by mathematical induction on n. Let P(n) be the statement “for 
every natural number m Æ n, either m C n or n C m” 


— Base cases n = Ø or 1: For n = Ø, the statement @ C m holds true for all non-zero 
natural numbers m. For n = 1 and m = 1, Ø C 1 = {Ø}. Suppose m is a natural 
number other than 0 and 1. Then Ø C m > Ø+ = {Ø} C m (by the theorem 13.9 
above). Since m Æ 1, then {Ø} C m holds true for any such m (since Ø € m for 
every non-zero natural number m). 

— Inductive hypothesis: Suppose P(n) holds true for some natural number n. That 
is, suppose n is a natural number such that for any natural number m not equal 
to n, either m C n or n C m. We are required to show that P(n*) holds true. 


Let m be a natural number such that m 4 nt. Case 1: If m = n, then m € nU {n} 
and so m C nU {n} (by theorem 13.9) and we are done. Case 2: Suppose m # n. 
Then, by the inductive hypothesis, either m C norn C m. If m C n, then 
m C n™ =nU {n}. Done. If n C m, then n* C m (by part a)). Since m Æ n*, 
n* C m. Then P(n*) holds true. 


By the principle of mathematical induction, for any pair of distinct natural numbers 
m and n, either m C n orn C m. Since m C n if and only if m € n, this property 
also holds true with respect to the relation “e”. 


What we are given: That n and m are distinct natural numbers. What we are re- 
quired to prove: That n C m C n* is impossible. 


Suppose n C m C nt. Then m € nU {n}. Since m Æ n, then m € n which means 


m C n. This contradicts our hypothesis, n C m. We have shown that n C m C n* is 
impossible, as required. 


The immediate predecessor of a natural number. 


We have seen that the elements, n, of the natural numbers are equipped with an “im- 
mediate successor” constructing algorithm, nt = n U {n}, where n C nt = nU {n} 
and no other natural number m sits between n an n*. It is normal to ask if every 
non-zero natural number has an “immediate predecessor”. That is, given an arbitrary 
natural number, n, are we guaranteed that there exists a natural number, k, such 
that kU {k} = k*t =n. Part c) of the theorem above guarantees that there can be no 
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natural number between k and n, and so such a k would be the immediate predecessor 
of n. If such a k exists, is there a way to construct this predecessor k of n just as 
we were able to construct an immediate successor of a natural number? The next 
theorem shows how we can construct the immediate predecessor of a natural number. 


Theorem 13.11 If m and n are natural numbers such that mt = n, then m is called an 
immediate predecessor of n. For any non-zero natural number n, k = U{m €N:mc n} is 
a natural number which is an immediate predecessor of n. 


Proof: 


What we are given. That n is a non-zero natural number. 

What we are required to show: That k = U{m € N: m C n} is a natural number and 

kt =n. 

Proof by induction. Let P(n) be the statement “k = U{m € N : m C n} is a natural 

number and kt = n”. 

— Base cases n = 1 or 2: If n = 1, then k = Ume Mm = Ø a natural number such that 
kt = @U{@} = {Ø} = 1 =n. Ifn = 2, then k = Ume M = GUl = ØU{Ø} = {Ø} = 1 
a natural number such that kt = {Ø} U {{o}} = {9, {Ø} =2 =n. 

— Induction hypothesis: Suppose P(n) holds true. That is, suppose n is a natural number 
for which [J m)” =n. To show that P(n*) 


holds true, it suffices to show that Umen+ M is a natural number and (Cees m)~ = 
nt. Let k= Ument M. See that 


m is a natural number satisfying (U 


men men 


k = UmeEN: menu {n}} 
= UimeN:menorm=n} 
= UimeN:mentun 


By the induction hypothesis, U{m € N : m € n} is a natural number which is an 

immediate predecessor of n. Then it must be a proper subset of n. It follows that 

U{m EN: men}Un=n. Then k =n which implies kt = n*. So P(n*) holds true. 
By mathematical induction, for every natural number n, U{m € N:m C n} is a natural 
number and (U{m EN:mCn})t =n. 


Theorem 13.12 Unique immediate predecessors. Any non-zero natural number has a 
unique immediate predecessor. 


Proof: 


We prove this by induction. For non-zero natural numbers n let P(n) be the statement “ 
the natural number n has a unique immediate predecessor” . 
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— Base case n = 1: By definition, 1 = {Ø} = Ø U {Ø} = @t = 0+. So P(1) holds true. 

— Induction hypothesis: Suppose n is a non-zero natural number such that P(n) holds 
true. That is, there is only one natural number m such that m+ = n. We are required 
to show that nt has a unique predecessor. Trivially, n is one immediate predecessor of 
n*. Suppose k is another natural number such that kt = nt. Then nU{n} = kU{k}. 
We claim that n = k. Suppose not. Then both n € k and k € n must hold true. We 
have shown that every natural number is a transitive set (see theorem 13.9 a)). By 
theorem 13.7, n € k and k € n implies n € n. By 13.9 part c) this cannot be true for 
any natural number, and so we have a contradiction. Then n = k as claimed. Hence, 
n* has as unique immediate predecessor, n. So P(n*) holds true. 


By mathematical induction, every non-zero natural number has a unique immediate pre- 
decessor. 


13.7 The second version of the Principle of mathematical induction. 


The following theorem is a variation of the Principle of mathematical induction. It 
may sometimes be more efficient to apply this version when proving certain theorem 
statements. Although we will not present an application of this version now, we will 
soon see some proofs in which this version is easier to apply. 


Theorem 13.13 (The Principle of mathematical induction: second version.) Suppose P(n) 
is a property whose truth value depends on the natural number n. Suppose that for any 
natural number n, 

[P(k) is true for all k < n] > [P(n) is true] 


Then P(n) holds true for all natural numbers n.1 


Proof: 


Given hypothesis: [P(k) is true for all k < n| = [P(n) is true] 

What we are required to show: That P(n) holds true for all natural numbers n. 

Let P*(n) denote the statement “P(k) is true for all k < n”. We will show by induction 

(original version) that P*(n) holds true for all n. From this we will conclude that P(n) 

holds true for all n. 

— Base case: Since P(k) vacuously holds true for all k € @, then by the given hypothesis, 
P(0) = P(@) holds true. So the base case P*(0) is satisfied. 

— Inductive hypothesis: Suppose P*(n) holds true for some natural number n. This 
means “P(k) is true for all k < n”. By the given hypothesis, P(n) must hold true. 
Then “P(k) is true for all k < n*”. That is, P*(n*) holds true. 


1Note that k < n and k € n are equivalent expressions. 
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By mathematical induction, P*(n) holds true for all natural numbers n. 


Let m be any natural number. Then, by what we have just shown, P*(m™) holds true. 
That is, “P(k) is true for all k < mt”. So P(m) is true. The statement is thus proved. 


13.8 A few words about the Peano axioms. 


There is a particular set of axioms which serves as a foundation for all mathematical 
statements related to the natural numbers. These axioms are not set-theoretic and 
slightly predate the ZFC-axioms. In 1889, the Italian mathematician Giuseppe Peano 
proposed a set of 9 mathematical statements from which evolve all mathematical 
statements relating to the natural numbers. Today these are referred to as the Peano 
axioms (the Italian name “Peano” is pronounced as ‘pay-ah-no’). We will see that 
each of these axioms belongs to ZFC set-theoretic universe and so as a group play the 
role of intermediary — certainly a more easily understandable one — between the Set 
theory axioms and the body of mathematics we refer to as number theory. We will 
list the 9 Peano axioms below. The symbol “0” is an undefined symbol. The symbol 
“S” represents a single valued function we refer to as the “successor function” on the 
natural numbers. 


P1 The symbol 0 is a natural number. 


Peano axioms on equality 
P2 Every natural number is equal to itself. That is, equality “=” is a reflexive binary 
relation on N. 


P3 If n and m are natural numbers such that n = m then m = n. That is, equality 
is a symmetric binary relation on N. 


P4 If n, m and k are natural numbers such that n = m and m = k then n = k. 
That is, equality is a transitive binary relation on N. 


P5 If n is a natural number and “a = n” then a is a natural number. 
Properties involving the successor function, S. 
P6 If n is a natural number then so is the image, S(n), of n under S. We refer to 


“S(n)” as a successor of n. 


P7 The natural numbers n and m are equal if and only if S(n) = S(m). Hence, 
every natural number has precisely one successor and a natural number is the 
successor of, at most, one natural number. 


P8 For any natural number n, S(n) 4 0. 


Mathematical induction 
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P9 If M is a set which contains the natural number 0 and whenever n is a nat- 
ural number then so is S(n) then M contains all natural numbers. That is 


{0, S(O), S(S(0)), S(S(S(0))), ---,} E M 


We verify that each of these 9 statements follows from ZFC-axioms. The empty set, 
Ø, is easily perceived as being the natural number 0. Equality of sets is reflexive in 
ZFC (this follows just about immediately from axiom A1) and so this must hold true 
for those sets in ZFC we call the natural numbers. Symmetry and transitivity of “=” 
on sets automatically applies to those sets we call the natural numbers. So P3 and 
P4 also belong to ZFC. Equal sets contain the same elements and so P5 holds true in 
ZFC (See theorem 2.3 c) ). By the axiom of pair and union, for any natural number 
n, nU {n} = S(n) is a set. So P6, P7 and P8 easily follow from this definition of the 
“successor of n”. The Mathematical induction statement, P9, follows from the Axiom 
of infinity. Note that the Axiom of power set, the Axiom of replacement, the Axiom 
of regularity and the Axiom of choice are not required to do mathematics with the 
natural numbers. 


Concepts review: 


1. 


2. 


10. 


11. 


12. 


13. 


If x is a set then what is its successor? 


What is an inductive set? 


. What does the Axiom of infinity state? 

. How is the set of natural numbers defined? 

. List the first four natural numbers using set notation. 
. What is the Principle of mathematical induction? 

. What is a transitive set? 

. If n is a natural number, is n a transitive set? 


. What is the difference between an inductive set and a transitive set? 


Is N a transitive set? 
Give a characterization of transitive sets. 
Is it true that any element of a natural number is a natural number? Why? 


Can a natural number be an element of itself? 


122 


14. 
15. 
16. 
17. 
18. 


19. 


20. 
21. 


22. 
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Is N is a natural number? Why? 

If n is a natural number how many successors can n have? 

What is a second version of the Principle of mathematical induction? 

If n is a natural number what does it mean to say that m is its predecessor? 
Give an expression which describes the predecessor of a natural number n. 


If m and n are natural numbers such that m € n can it happen that m* = n? Can 
it happen that n € mt? 


If m is a subset of the natural number n is it possible that m € n? In which case? 
Are there any natural numbers which are inductive sets? 


For three natural numbers m, n and t satisfying m € n and n € t does it always follow 
that m € t? 


EXERCISES 


1. Let m and n be two natural numbers. Prove that if m € n, then m Æ n™. 
. Show that if n is a natural number, then nt Æ 0. 
. Write down the natural number 5 using only left and right brackets, commas and 


the symbol “g”. 


4. If n is a natural number is n and inductive set? Justify your answer. 
. Is the set of all natural numbers a natural number? Justify your answer. 


6. Is NU{N} a natural number? Explain why or why not. 


7. Suppose n is a non-zero natural number. 


a) Is A(n) a natural number? Why? 
b) Does A(n) contain a natural number? Which one? 
c) List all elements of A(3). 


. Consider the class Y(N). 


a) Is A(N) a set? Why? 
b) Does A(N) contain any natural numbers? Explain. 


c) Does P(N) contain elements which are not natural numbers? If so, list at least 
three. 


d) Is A(N) a natural number? Why? 


Part V: From sets to numbers 123 


11. 
12. 


13. 


14. 


. Is NU {N} a transitive set? If so prove it. If not say why. 
10. 


Is NU {N} an inductive set? If so prove it. If not say why. 


Show that finite unions and finite intersections of transitive sets are transitive sets. 


Suppose S C N. Suppose that the union of all elements of S is S. Prove that S 
cannot be a natural number. 

Jo-Anne has defined the natural numbers in the ZFC-axiomatic system as follows. 
She defined an inductive setas “S is inductive if, whenever x € S, then {x} € S”. By 
first invoking the axiom of infinity she defines the natural numbers N as the smallest 
inductive set linearly ordered by “€”. She defines 0 = Ø, 1 = {Ø}, 2 = {{1Ø}}, 
3 = {{{o}}}, 4 = {{{{o}}}} and so on. We see that 0E€1€2€36€4---. Will 
this work as a definition of the natural numbers? If so, say why. If not, explain why. 
Show that N = U{n: n € N}. 
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14 / The natural numbers as a well-ordered set. 


Summary. In this section we introduce the notion of “well-ordered set”. We 
show that the set of all natural numbers, when equipped with the membership 
ordering relation “€”, is a well-ordered set. We then define “bounded set” and 
“the maximal element of a set”. Finally we show that bounded subsets of N 
must contain a maximal element. We then use N to construct various other 
sets, some of which are also well-ordered. 


14.1 Order relations on N. 


We have seen that the definition of the natural numbers within the ZFC-axiomatic 
system leads to two equivalent order relations on N. Both “Cc” and “c” have been 
shown to be equivalent strict linear orderings of N, in the sense that n C m if and 
only if nm € m. 


We can naturally extend the strict order relation “C” to the non-strict order relation “C” 
while maintaining the linearity property. That is, m C n if either m C n or m= n. 
We can similarly extend the relation “€” by introducing the following notation. 


Notation 14.1 We define the relation “€” on N as follows: 
m E- nif and only ifm=normen 


If m €_ n and we want to state explicitly that m Æ n we write mEn. 


14.2 A well-ordering of N. 


There is an important property that is not possessed by all linearly ordered classes. 
It is called the well-ordering property. We formally define this property. We will then 
prove that (N, €) is a well-ordered set. 


Definition 14.2 Let (S, <) be a linearly ordered set. Suppose T C S. 


a) We say that the element q is “a least element of T with respect to <” if and only if 
qg€Tandq<mforallmeT. 


b) If S is equipped with a strict linear ordering “<” we say that q is a least element of 
T with respect to < if and only if q E€ T and q < m for all m € T where m £ q. 
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c) The set (S, <) is said to be well-ordered with respect to “<” if every non-empty subset 
T of S contains its least element with respect to <. Similarly, the set (S, <) is said 
to be well-ordered with respect to “<” if every non-empty subset T of S contains its 
least element with respect to <. 


We show that € well-orders the set N. 


Theorem 14.3 The set N of all natural numbers is a strict €-well-ordered set. 
Proof: 


Given: The relation “€” strictly linearly orders N; the set A is a non-empty subset of N. 
Required to show: That A contains a least element with respect to €. 

Proof by contradiction: Suppose A does not contain a least element. We claim that A 
must then be empty, thus contradicting our hypothesis. 


— Proof of the claim: We invoke the second version of the Principle of mathematical 

induction. For each natural number k, let P(k) denote the statement “k ¢ A”. 
Induction hypothesis: Let n be some natural number such that P(k) = “k ¢ A” 
holds true for all k € n. 
Suppose n € A. Then, for all a € A, n € a (for if a € n, then, by the induction 
hypothesis, P(a) = “a ¢ A” holds true). This means that n is a least element of 
A with respect to €. This contradicts “A contains no least element with respect to 
€”. Then n ¢ A. Then, P(n) = “n ¢ A” holds true. By the second version of the 
principle of mathematical induction, P(k) = “k ¢ A” holds true for all k € N. Then 
A contains no elements, as claimed. 


This contradicts the fact that A is non-empty. The source of this contradiction is our 
assumption that A does not contain a least element. We must conclude that every subset 
of N has a least element with respect to “Ee”. 


We have previously shown that the second version of the Principle of mathematical 
induction follows from the first version or the Principle of mathematical induction. 
We can show that if we only assume that N is €-well-ordered and the second version 
of the induction principle, then the first version of the induction principle holds true. 
The proof is as follows. 


What we are given: That N is €-well-ordered and that the second version of the Prin- 
ciple of mathematical induction holds true. 


What we are required to show: That the first version of the Principle of mathematical 
induction must hold true. 
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Let P(n) be a property whose truth value depends on the natural number n. Suppose 
P(0) is known to be true. Also suppose that if P(n) holds true, then so does P(n*). 
Let A = {k € N: P(k) is false }. Then N — A contains 0 and, whenever n € N — A, 
then nt EN- A. 

We claim that A must be empty (hence, P(k) holds true for all k € N). 


Proof of claim: Suppose A is non-empty. Then, since N is €-well-ordered, A has 
a least element, say s = m*. Then P(k) holds true for all k € s = m*. Then 
m € N— A. By hypothesis, m* € N— A. This contradicts the fact that mt is 
the least element of A. The source of the contradiction is the assumption that A is 
non-empty. Then A must be empty, as claimed. 


We conclude that the set A is empty and so P(k) holds true for all natural numbers k. 
We have thus shown that the first version of the Principle of mathematical induction 
on N holds true. 


Corollary 14.4 Every natural number n is a €-well-ordered set. 


Proof: 


Let n = {0,1,2,...,n—1} be a natural number. We already know that the natural numbers 
are €-linearly ordered. Let U be a non-empty subset of n. Then U is a non-empty subset of 
N. When viewed as a subset of the €-well-ordered set N, the set U contains a least natural 
number, say k. Then k €- m for allm € U. So when U is viewed as a subset of n, k is the 
least element of U. So n is €-well-ordered. 


14.3 


We have thus shown that not only is N a well-ordered set, but so is every single natural 
number. 


Bounded subsets of N 


The reader may be familiar with the concept of bounded subsets. In the context of 
a linearly ordered set (S,<), we say that a subset A of S is bounded above, or has 
an upper bound if there exists some element M € S we call an “upper bound of A” 
such that « < M for all x € A. A subset can have many upper bounds. Similarly the 
subset A is “bounded below” if there exists an element m we call a “lower bound of 
A” such that m < x for all x € A. For example, every non-empty subset of (N, €) is 
bounded below by 0. 


Suppose A is a non-empty subset of a linearly ordered set (S, <) which contains an 
upper bound M of A. Then, since A is linearly ordered, for every element x € A, 
x < M. Furthermore, M is the only upper bound of A which is contained in A, for if 
M* is another upper bound of A which is contained in A, then M* < M; if M* < M, 
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then M* is not an upper bound of A; so M* = M. In this case, we can refer to M 
as being the maximal element of A or the maximum element of A. This corresponds 
to the definition we have previously provided for the words “maximal element” and 
“maximum element” of an ordered set. For linearly ordered sets, in this context, the 
words “maximal” and “maximum” are interchangeable. Similarly, if A contains a 
lower bound m, then m is the (unique) minimum element of A. 


The following theorem shows that any non-empty bounded subset of N must contain 
a maximal element with respect to “Ee”. 


Theorem 14.5 Any bounded non-empty subset S of (N, €) has a maximal element. 
Proof: 


Suppose S is a non-empty bounded subset of N with respect to the linear ordering “e”. 
Let Q be the set of all upper bounds of S. Since S is bounded, by definition, it has at 
least one upper bound and so Q # Ø. Since “c” well-orders N, Q must contain a “least 
element”, say k. We claim that k is a maximal element of S. To prove this claim it 
suffices to show that k is both an upper bound of S and belongs to S. Since k € Q, then 
k is an upper bound of S. We now show that k € S: Suppose k ¢ S. Let t be the unique 
immediate predecessor of k. That is, tt = k. Then if x € S, 


kg S => «Fk, 
=> «zeEk 
= gett 


gett => xe (tu {t}) 
=> xeéetorze {t} 
=> xetorx=t 


We have shown that for every x € S, x E= t; hence, t is an upper bound of S. That is, 
t € Q. But t € t™ =k where k was declared to be the least element in the set Q. This is 
a contradiction. Then k € S as claimed. So S has a maximal element. 


14.4 Constructing other well-ordered sets from N 


We have seen that the order relation “€” is a strict well-ordering of the set of all 
natural numbers, N. Now that we have given ourselves a large set to work with we 
will use the set N as a building block to construct other large well-ordered sets. We 
will introduce an order relation on various Cartesian products involving N. This par- 
ticular order relation is defined in terms of the ordering “€” on N. 
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Lexicographic ordering of the Cartesian product {1,2} xN. Consider the subset {1,2} x 
N= {(i,n):i=1 or 2,n€ N} of Nx N. We will order the elements of {1,2} x N by 
what is called a lexicographic ordering', denoted by <,-x. This means (a,b) <jex (c, d) 
ifa € cor, ifa = c, b € d. For example, (1,34) <,, (2,7) and (2,5) <iex (2,54). 
The elements of ({1,2} x N, <iex) can then be listed in a strictly increasing order as 
follows: 


{(1, 0), (1, 1), (1,2), (1, 3),- a , (2,0), (2, 1), (2,2), (2, 3),- aes) (3, 0), (3, 1),- >> .} 


The lexicographic ordering inherited from “€” is easily seen to be a linear ordering of 
{1,2} x N. We now investigate specific ordering properties of ({1, 2} x N, <:x). 


a) The order relation <, on {1,2} x N is a well-ordering. Consider a non-empty 
subset A of {1,2} x N. If there exists elements of the form (1,a) in A, then the 
least element of A is (1,m) where m is the least element of {n € N: (1,n) € A}, 
guaranteed to exist since € well-orders N. If all elements of A are of the form 
(2, b), then the least element of A is (2, m) where m is the least element of {n € N: 
(2,n) € A}, again guaranteed to exist since € well-orders N. Then ({1, 2} xN, <iex) 
is a well-ordered set. 

b) What are bounded and unbounded subsets of {1,2} x N with respect to <j, like? 
As examples we consider the following subsets of {1,2} x N. 

— If SC {1} x N, then any element which has the form (2,7) is an upper bound 
of S with respect to <jex 


— If T is a bounded subset of N, then there exists a natural number m such that 
t€ m for all t € T. Then (2,m) would be an upper bound of any subset of 
{1,2} x T C {1,2} x N. 

— There is no natural number n which is an upper bound of N. Then no element 
of {1,2} Nis an upper bound of {1,2} x N. In this case we say that the subset 
{2} x N is unbounded in {1,2} x N. 

c) On maximal elements of subsets of ({1,2} x N, <s). We have seen that every 
bounded subset S of N has a maximal element. Does every bounded subset of 
{1,2} x N contain a maximal element? The subset T = {(1,n) : n € N} of 
{1,2} x N is seen to have an upper bound (2,0). In fact, this element (2,0) is the 
smallest upper bound of T since any smaller element would be of the form (1,7). 
But since (2, 0) does not belong to T, then T does not contain a maximal element. 
So the lexicographically ordered set {1,2} x N possesses at least one ordering prop- 
erties which are not shared by N. 


Lexicographic ordering can be used to well-order other sets such as 


S = {0,1,2,3} xN CN XN 


1Some texts may refer to this as the “dictionary ordering”. 
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14.5 


This well-ordered set can be visualized as four copies of N lined up one after the 
other. The first copy is of the form {(0,n) : n € N} and the fourth copy is of the 
form {(3,n) :n € N}. The set N x N = {(a1, a2) : a1,a2 € N} can also be ordered 
lexicographically. When ordered in this way N x N can be viewed as {0} x N followed 
by {1} x N followed by {2} x N, and so on. 


Constructing non-well-ordered sets from N 


The sets we will now consider may seem less familiar to many readers who are not yet 
accustomed to viewing functions as “sets” of ordered pairs. We will in fact go a step 
further and discuss sets of functions whose domain is N. 


Let’s consider the set of all functions mapping N into {1,2}. This set is normally 
denoted as {1,2}. An element f of {1,2}% is a set of ordered pairs (n,m) where n 
takes on any number in N and m is either the natural number 1 or the natural number 
2. Furthermore, only one value m can be the image of a value n in N under a function 
f- We can express a function f : N —> {1,2} as the set 


f={(n,m)—eNxN: m= f(n) =1 or 2} 


where [(n,m) € f and (n, k) € f| > [m =k]. We see that f is a subset of N x {1,2} 
(not an element of N x {1,2}). That is, f € A(N x {1,2}). Then any specific function 
f in {1,2}% can be expressed as an infinite set of ordered pairs 


{(0, ao), (1, a4), 2; a2), (3, a3), ee ,} 


where a; = f(i) = 1 or 2. Notice that we have surreptitiously imposed a lexicographic 
ordering on the elements of f, since (n, Gn) < (Mm, am) if n < m. Actually we could 
more succinctly express this element f as a sequence 


{ao, Q1, 42, 43,..-. ,} 


where each a; = f (i) is the image of i under f.! So if f € {1,2}, f can accurately be 
described as an infinite sequence of 1’s and 2’s in an order dictated by the associated 
elements in the domain of f. This is the particular way we will view the elements of 
{1,2}\. That is, 

{1,2} = ({a}%o : a; =1 or 2} 


So comparing two functions f and g in {1,2} is essentially comparing two infinite 
sequences of 1’s and 2’s. Given the set S = {1,2} we want to define on order relation 
on S. The order relation that we will choose is inspired by the lexicographic ordering 
defined on subsets of N x N above. We formally define it below.” 


‘So a; is in fact shorthand for (i, f(z). 

?Even though the following definition of the ordering on the set {1,2} is inspired from the lexicographic 
ordering of sets of ordered pairs and adopts the notation <jex, it is good to remember that we are not 
ordering ordered pairs but sets which represent functions. 
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Definition 14.6 Consider the set {1,2} of all functions mapping natural numbers to 1 
or 2. We define the lexicographic order “<ie” on {1,2}% as follows: For any two elements 
f = {ao, a1, a2, a3,..., } and g = {bo, by, b2, bs,..., pin {1,2}, f <,, g if and only if for the 
first two unequal corresponding terms a; and b;, a; E€ b;. Also, f = g if and only if a; = bi 
for all i € N.? 


For example, if f = {1,2,2,1,1,---,} and g = {1,2,2,2,1,2,---}, then f <,, g. This 
ordering is easily seen to be linear. We now investigate basic ordering properties of 
(FE, OY <i): 

a) The relation <ie, is not a well-ordering of {1,2}N. At first glance, based on our ex- 
perience with lexicographic orderings, we may suspect that <,.,. well-orders {1, oN. 
But we should be cautious. Does {1,2} have a least element? The lexicographic 
ordering rule shows that no element in {1,2}‘ can be smaller than {1,1,1,1,---}. 
So {1,2}% at least has a smallest element. Let’s try to think about what its sec- 
ond smallest element is. We have listed the elements of a subset of {1,2} in the 


form of a strictly decreasing sequence of elements where each element is larger than 
{1,1,1,1,1,1,1,1,1,1,}: 


ASA RS AA 
Pe PPP Pb 
Pe PRP PNP 
Pe PRP NP 
Pe PNP RE 
Pe NP RR 
PNP RRR 
NP PRP Pe 
Pe eee Pe 
Pee Ree Pe 
fe ee a 
“v a a ed 


We see that each element is strictly less than its immediate predecessor in this 
list. We also see that we can never reach {1,1,1,1,1,1,1,1,1,1,---} using such a 
decreasing sequence. This convinces us that the set 


S={fe€ {1,2}: f > {1,1,1,1,1,1,1,--}} 


does not contain its least element since no matter where we insert our first “2” you 
will be able to insert a “2” further down. Since the non-empty subset S has no 
least element with respect to the ordering “<,.,”, then {1, oun is not a well-ordered 
set. 
The set {1,2} is bounded. We easily see that {2,2,2,2,---} is a maximal element 
of {1,2}. So every subset of {1,2} has at least {2,2,2,2,---} as upper bound. 
c) On maximal elements of bounded sets. Does every bounded subset of {1, 2} con- 
tain a maximal element? To help answer this question let’s try to find the element 


Z= 


3A lexicographic ordering can similarly be defined on SN where S is any subset of N. 
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14.6 


of {1,2}‘ which immediately precedes {2,2,2,2,---}. Another way of stating this 
is: What is the maximal element of S = {f € {1,2}: f < {2,2,2,2,---}}? This 
maximal element must have at least one “1” in it, along with as many 2’s as pos- 
sible. The question is where shall we insert this “1”? No matter where we insert 
this “1” we will be able to reconsider our choice and reinsert it farther down. So 
S contains no maximal element. 


The set NUZ., 


Having studied the set {1,2} of all functions whose domain is N and codomain is 
{1,2} we now consider the set of all functions with domain {1,2} and codomain N. 
This set is denoted as Nt!?}. Of course, if f € NU}, then f = {(1, a1), (2, a2)} where 
a, = f(1) € N and a = f(2) € N. See that we have purposely lexicographically 
ordered the two elements (1, a1) and (2, a2) of f. Just as for the functions in {1,2}% 
we can more succinctly represent f € Nt?) as f = {a 1, a2}, ordered doubletons of 
natural numbers. That is, 


Nib = {{a1, a2} : a1, a2 E N} 


is a set of ordered doubletons. Since these doubletons are ordered natural numbers 
we can lexicographically order the elements {a1, a2} of N{L2} as if N12} was a set 
of ordered pairs (a1, a2) of natural numbers. That is, a function ¢ : N x N => NÌ? 
defined as ((a1, @2)) = {a1, a2} can be used to lexicographically order the elements of 
Nt2} in such a way that the elements of NT}? are ordered in precisely the same way 
as the elements of N x N. This ordering on Nt!?} is also referred to as a lexicographic 
ordering and is also denoted by the same symbol, “<,,,”. So (N2, <iex) has order 
properties which are identical to those of (N x N, <iex). 


Concepts review: 


1. 


2. 


What does it mean to say that “<” strictly well-orders a set S? 


Describe two order relations which well-order N? 


. What does it mean to say a subset S of N ordered by “€” is bounded? 


. What does it mean to say that a non-empty subset S of N ordered by “e” has a 


maximal element? 


. Describe the set {1,2} x N by providing three distinct elements of this set. 


. Define the lexicographic ordering on {1,2} x N. 


132 


T. 


8. 
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10. 


11. 


12. 


13. 
14. 
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Define the lexicographic ordering on {1, 2}. 

List three elements of {1,2} in increasing order. 

Is the lexicographically ordered set {1,2} x N well-ordered? 
Is the lexicographically ordered set {1,2} well-ordered? 


Does every non-empty subset of the lexicographically ordered set {1,2} x N have a 
maximal element? 


Does every non-empty subset of the lexicographically ordered set {1,2}‘ have a max- 
imal element? 


Does {1,2} have a maximal element? 


Describe the elements of N{2}. Propose an ordering for its elements. 


EXERCISES 


1. Show that (N, €) does not contain a maximal element. 
. Can a non-empty bounded subset S of N ordered by “€” be an inductive set? Why 


or why not? 


. Construct three non-empty subsets of {1, 2}N each of which contains no least element. 


. Consider the lexicographically ordered set (N x N, <jex). 


a) Describe the first few elements of (N x N, <jex). 
b) Does N x N have a maximal element? What is it? 


c) Is (N x N, <x) a well-ordered set? If so show it. If not produce a non-empty 
subset which does not contain its least element. 


d) Does every bounded subset of N x N have a maximal element? Why? 


. Consider the lexicographically ordered set S = {1,2,3,---, g}N of all functions map- 


ping N to the set {1,2,3.--- ,9}. 

a) Describe the first few elements of S. 

b) Write the three elements f = {5,5,5,5,5,...,}, g = {4,9,2,2,2,...,} and h = 
{4,9,1,9,9,9,...} in increasing order. 

c) Does S have a maximal element? What is it? 


d) Is S a well-ordered set? If so show it. If not produce a non-empty subset which 
does not contain its least element. 


e) Does every bounded subset of S have a maximal element? Why? 
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6. Suppose we represent the set of all functions mapping {0,1,2} to N as 
Nt01.23 = {{ag, a1, a2} : a; € N} 


Suppose we order this set lexicographically. 

a) Describe the first few elements of N1012}, 

b) Write the three elements f = {4,0,600}, g = {600,9,8} and h = {6,9,53} in 
increasing order. 


Does N{®12} have a maximal element? What is it? 


SF b on 


Cc 


d) Is N{®1:2} a well-ordered set? If so show it. If not produce a non-empty subset 
which does not contain its least element. 


e) Does every bounded subset of NT®1:2} have a maximal element? Why? 
7. Consider the lexicographically ordered set N of all functions mapping N into N. 
a) Describe an element of NN as a set. 
b) Does NN have a least element? What is it? 
c) Does NN have a second element with respect to the lexicographic ordering? What 
is it? 
d) If f € NY, can f be viewed as a subset of N x N? Explain. 
e) If f € NN, can f be viewed as an element of A(N x N)? Explain. 
f) Can NN be viewed as a subset of A(N x N)? Explain. 
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15 / Arithmetic of the natural numbers. 


15.1 


15.2 


Summary. In this section we define addition, subtraction and multiplication 
of the natural numbers in a set-theoretic context. We then show that with these 
definitions, we obtain the expected results. Addition and multiplication on N are 
defined recursively. 


Highlights of what we have learned in set theory up to now. 


Now that we have defined the natural numbers within the framework of set theory it 
is a good time to look back on what we have learned and provide some insight on 
what is to come. 


In our theory, all objects are classes or sets. Since we are mainly interested in those 
objects called “sets”, our attention is directed towards these specifically. A few funda- 
mental properties of classes and sets are stated without proof in the form of axioms. 
Axioms are normally expected to be intuitively obvious to users. This does not mean 
that choosing axioms is done without debate, since what is obvious to one may not 
be obvious at all to another. We listed ten set-theoretic axioms referred to as the 
“ZFC-axioms”. The ZF-axioms refer to the first nine, while the “C” refers to the 
tenth axiom called “Axiom of choice”. The Axiom of infinity which postulates the 
existence of a set which satisfies the essential properties of the natural numbers is not 
an axiom which is “intuitively obvious”. It is however perceived as being essential 
since it provides a logical basis on which rests most of the mathematics we do today. 
The most controversial axiom is the Axiom of choice. We have not invoked this axiom 
yet. We will soon see why it is needed if we want our set-theoretic universe to unfold 
as we think it should. 


Once we gave ourselves classes, sets and a few axioms to work with, we gave ourselves 
the means to construct classes and sets from ones that exist (union, intersection, 
Cartesian products). This was followed by definitions of relations and functions both 
of which exist in our set-theoretic universe as sets. Finally, we defined the natural 
numbers so that they possessed the required well-ordering property. Of course, giving 
life to the natural numbers is just the beginning. Our next step is to appropriately 
define addition and multiplication of the natural numbers. Definitions must be such 
that results we obtain with these operations are what we expect them to be. Def- 
initions of the integers, the rational numbers and the real numbers will then follow. 
Finally, we will study infinite sets and their various properties. 


Defining addition recursively. 


Since natural numbers are sets, we may instinctively attempt to define addition of the 
natural numbers as follows: 3+5 = 3U5. But this doesn’t produce the desired result 
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since we know that 3 C 5 and so 3 +5 = 3U 5 would equal 5, not what we want at 
all. We see that viewing addition of natural numbers by simply joining sets together 
will not work, particularly if the sets have non-empty intersection. It may be that we 
are attempting to define too much at the same time. We should maybe try a step by 
step definition of addition. We will experiment with addition of natural numbers with 
the natural number “3”, specifically. Suppose we define addition of 0 to 3 as follows: 
3+0=3U9 = 3. Then we progressively define addition with 3 of successively larger 
and larger numbers. In what follows the symbol “:=” will serve as a succinct way of 
saying “is defined as”. 





























3+0 = 3 

3+1 = 3407 :=(3+0)" =3" =4 
3+2 = 3+1*:=(3+1)=4*=5 
3+n = m 

34n? c= (B3+n) =m" 


In this sequence of sums only the initial sum 3 + 0 = 3 is specifically defined while a 
globally defined rule “3 + n™ = (3 + n)™” is applied to evaluate the sum of all other 
natural numbers with 3. If we know the numerical value of 3 +n, then, by the general 
“rule”, 3 +n” has numerical value (3+ n)+. So if we want to determine the value of 
3 + 4 we first have to find the values of 3+ 1, 3 + 2 and 3+3.! 


3+4 = 




















It is a sure (albeit tedious) method of addition that will consistently produce the 
unique expected values for sums of two natural numbers. For example, once we have 
computed 3 + 4 = 7 we can then compute 3 +5 = (3+4)* = 7* = 8 by applying the 
algorithm 3+n* = (3+7n)*. Once all values of 3 +n are obtained we then obtain all 
values for 4+ n as n ranges over N, and so on. This simply shows that it is possible 


1This is reminiscent of the way we learned addition by using addition tables in elementary school: Before 
learning that 3+ 2 = 5, we learned that 3+ 0 = 3, 3+ 1 = 4 and deduced from this that 3 + 2 had to equal 
5. 
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to adequately define addition of natural numbers in a universe of sets in such a way 
that sums correspond to sums obtained by usual addition algorithms. This does not 
prevent us from using the various algorithms which allow us to obtain more efficiently 
the values of sums of natural numbers. 


We propose the following formal definition of addition of pairs of natural numbers. 


Definition 15.1 Let m be a fixed natural number. Addition of a natural number n with 
m is defined as the function rm : N —> N satisfying the two conditions 


Tm = m 


Tm(nt) = [rm(n)|* 
The expression m + n is another way of writing rm(n). Thus, 


Tm(0)=m & m+0=m (1) 
tm(nt) = [rm(n)t & m+n = (m+n) (2) 


For example, once the value of 34 + 0 = 34 is declared, the value of the sum 34 + 123 
is uniquely determined by applying the formula 34 + n* = (34+ n)* finitely many 
times to successively obtain 34+ 1 = 35, 34+2 = 36, 34+ 3 = 37,...,34+123 = 157. 


Readers may have no doubt noticed that the function rm(n) is defined by using a 
mechanism that we have not used or seen before in this text. We are accustomed to 
defining a function f : A — A by declaring a rule which associates to each element in 
A some other element in A without referring to other ordered pairs (a, f(a)) in f. For 
example, the only way we can confirm that the ordered pair (2,3+2) = (2,5) belongs 
to the function r3 is by first determining that (0,3 +0) = (0,3) and (1,3+ 1) = (1,4) 
also belong to r3. Most readers will intuitively feel that there is no ambiguity in the 
way we have defined the function rm on N. We refer to functions which are defined in 
this way as being recursively defined functions. If rm is indeed a well-defined function, 
then we must be able to prove that it satisfies the conditions stated in the formal defi- 
nition of a function. We remind ourselves how we defined a “function” (see Definition 
9.1): Given two sets A and B, a function is a subset f of A x B which satisfies the 
property “(x, y) and (x, z) belong to f implies y = z”. There is no reason to deviate 
from this understanding of functions. We will now formally show that this recursively 
defined function of addition satisfies the property described in the definition. That is, 
we will show that ifn =k, then rm(n) = m +n =m + k = rmk). 
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Theorem 15.2 Let m bea fixed natural number and let rm : N — N be a relation satisfying 
the two properties 
{ nO). =m 


Tm(n*) [rm(m)]* 


Then rm is a well-defined function on N. 


Proof: 
Let m be a fixed natural number. Let .Y be a class of relations on N defined as follows: 
SY ={RCNXN:(0,m)€ Rand (n,y) € R= (nt, y") € R} 


Now Z is non-empty since it contains N x N. Let r* = pey R. This means that r* is 
the smallest set of ordered pairs satisfying the conditions described for .Y%. The relation 
r* looks something like 


r* = {(0,m), (1, m”), (2, (mt)*), +++} 


We will show two things: 1) That r* is a function mapping N into N, and, 2) That 
rk = Tm. 


1) We claim that r* is a function mapping N into N. 


Proof of claim: 
We first establish (by induction) that dom r* = N. 


Base case: We first note that since (0,m) € r*, then 0 € domr*. 

Inductive hypothesis: Suppose n € domr*. Then (n,y) € r* for some y € N. This 
implies (n*,m*) € r* and so nt € domr*. 

Hence, by induction, the domain of r* is all of N. 


We now proceed to the proof of the claim. The proof of the claim invokes the sec- 
ond version of the principle of mathematical induction. Let P(n) denote the statement 
“i(n, x) E€ r* A(n,y) E€ r*] > [x = yl”. 
Inductive hypothesis: Suppose P(m) holds true all natural numbers m < n. That is, 
(m,x) € r* and (m,y) € r* implies x = y. We will show that given our hypothesis, 
P(n) must hold true. 
Suppose not. Suppose (n, x) € r* and (n, y) € r* where x 4 y. Let U = r*—{(n, y)} 
(the set r* take away the element (n, y)). Then U is still a relation belonging to Z 
which is strictly smaller than r*. But r* was previously declared to be the smallest 
of the relations in .Y%. We have a contradiction whose source is our assumption that 
x #y. Then z must be equal to y. We conclude that P(n) holds true as required. 


By mathematical induction (version two) P(n) holds true for all n. 
So r* is a well-defined function as claimed. 


2) We now claim that the function r* is the relation rm as defined above. 
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Proof of claim: 

The proof is by induction. Let P(n) denote the statement “r*(n) = rm(n)”. 

Base case: The statement P(0) holds true since r*(0) = m = rm(0) = m +0. Inductive 
hypothesis: Suppose P(n) holds true. That is, suppose (n, rm(n)) € r*. Then, by defini- 
tion of r*, (nt, rm(n)*) € r*. Since rm(n™) = rm(n)t, then (nt, rm(n*)) € r*. That is, 
r*(n*) = rm(n*). So P(n”) holds true. 

By mathematical induction r*(n) = rm(n) for all n. 

Then r* = rm as claimed. 

We conclude that rm and r* are indeed the same relation. Since r* was shown to be a 
function, the recursively defined relation rm : N — N is a well-defined function. 


15.3 Basic properties of addition. 
We must now be sure that the addition operation we have defined on N satisfies the 
basic properties of addition we are accustomed to. 


1) For every natural number n, nt andn+1 are the same number. We are required to 
show that assuming 0* is denoted by the symbol 1, then for any non-zero natural 
number n, 

nt =l+n 
This can be shown by induction: 


Proof: By induction. Let P(n) be the property “nt = 1 +n” (where 1 = 0°). 
Base case: We see that P(0) holds true since 
ro+(0) = or + 0= or (By (1) in the definition of addition above). 
or =r éf + 0 (By notation). 


Inductive hypothesis: Suppose P(n) holds true for some n. Then 
Gat = (1 + n)t =n]; + nt (By (2) in the definition of addition above). 


So P(n”) holds true. By mathematical induction nt = 1 +n for all natural num- 
bers n. 


2) For any natural number n, 
O+n=n 

Proof: By induction. Let P(n) be the property “O+n =n”. 
Base case: We see that P(0) holds true since r9(0) = 0+ 0 = 0, by (1) in the 
definition of addition above. 
Inductive hypothesis: Suppose P(n) holds true. Then 0+ nt = (0 +n)® = n+, by 
(2) in the definition of addition. So P(n*) holds true. By mathematical induction 
0+n=n for all natural numbers n. 
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3) 


Addition of the natural numbers is associative. That is, for any three natural num- 
bers m,n and k 


(m+k)+n=m+4(k+n) 


Proof. By induction. Let m and k be any two natural numbers. Let P(n) be the 
property “(m+k)+n=m+(k+n)”. 

Base case: We see that P(0) holds true since (m+ k)+0=m+k=m+(k+0), 
(by (1) in the definition of addition above). 

Inductive hypothesis: Suppose P(n) holds true. Then 





(m + k) + nt = (m + k) + n] | (By (2) in the definition of addition above.) 
= [m + (k + a) (Since P(n) holds true.) 
= mt+(k+n)* 
= m+(k+n*) 


So P(n*) holds true. By mathematical induction (m+k)+n = m+(k+n) for all nat- 
ural numbers n. Since m and k were arbitrarily chosen, then (m+k)+n = m+(k+n) 
holds true for any three natural numbers m,n and k. 


Addition of the natural numbers is commutative. That is, for any two natural 
numbers m and n 


mt+tn=n+m 


Proof. By induction. Let m be any natural number. Let P(n) be the property 
“m+tn=n+m”. 
Base case: We see that P(0) holds true since 


m+ 0 =m (By (1) in the definition above.) 


= 0 +m (By Property 2). 


Induction hypothesis: Suppose P(n) holds true. Then 


m + nt = (m + n)t (By (2) in the definition above.) 


= (n + m)” (Since P(n) holds true.) 
= 1 + (n + m) (By property 1).) 
= (1 + n) + M. (By property 3).) 


= nt +m (By property 1).) 


So P(nt) holds true; by mathematical induction m + n = n +m for all natural 
numbers n. Since m was arbitrarily chosen, then m+n = n+m holds true for any 
pair of natural numbers m and n. 
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15.4 Definition of multiplication on N. 


As for addition, multiplication will be inductively defined. 


Definition 15.3 For any natural number m, multiplication with the natural number m is 
defined as the function sm : N — N satisfying the two conditions 


Sm(0) = 0 


Sm(n*) = sm(n)+m 


We define the expression mn and m x n as alternate ways of writing s,,(n). Thus 





Sm(0) = 0 m0=mx0=0 (3 
+ 


WS 





È 


SS 


Sm(N") = Sm n) +m & mnt =mnt+m=mxn+m (4 


Theorem 15.4 Let m be a fixed natural number and let sm : N — N be a function 
satisfying the two properties 


Then spm is a well-defined function on N. 


Proof: (Outline) 
Let .Y be a class of relations on N defined as follows: 
SY ={RCNxXN: (0,0) € Rand (n,y) € R= (n*,y+m) € R} 


Now Z is non-empty since it contains N x N. Let s* = (lge. R. This means that s* is 
the smallest set of ordered pairs satisfying the conditions described for Z. The relation 
s* looks something like 


s* = {(0,0), (1, m), (2, m+ m), (3,m+m+m),---,(n,mxn),--+,} 


Claim: s* is a function mapping N into N. Proof of the claim is left as an exercise. 
Claim: s* satisfies the properties which characterize Sm. Proof of the claim is left as an 
exercise. 


We conclude that Sm and s* are indeed the same relation. Since s* was shown to be a 
function, the recursively defined function sm : N — N is well-defined. 


We now verify that the expected properties of multiplication are satisfied. 
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1) 


For any natural number 
On = 0 


Proof: By induction. It is left as an exercise. 


For any natural number n, 
In=n 
Proof: By induction. It is left as an exercise. 


Multiplication of the natural numbers is distributive over addition. That is, for 
any three natural numbers m,n and k 


n(m+k)=nm+nk and (m + k)n = mn + kn 


Proof: By induction. Outline of proof for left-hand distribution. Right-hand 
distribution is left as an exercise. Let m and k be any two natural numbers. Let 
P(n) be the property “k(m +n) = km + kn”. 
Base case: We see that P(0) holds true since 


k(m +0) = km 
= km+0 
= km+k0 


Inductive hypothesis: If P(n) holds true, then 


k(m+nt) k(m+n)t 


k(m + n) + k (By 2) in the definition.) 


= km + kn + k (Since P(n) holds true.) 


— km + kn? (By 2) in the definition.) 


So P(n*) holds true. By mathematical induction k(m+n) = km+ kn for all nat- 
ural numbers n. Since m and k were arbitrarily chosen, then k(m+n) = km+kn 
holds true for any three natural numbers m, k and n. 


Multiplication of the natural numbers is associative. That is, for any three natural 
numbers m, n and k, 

(mn)k = m(nk) 
Proof: By induction. It is left as an exercise. 


Multiplication of the natural numbers is commutative. That is, for any two nat- 
ural numbers m and n, 
mn = nm 


Proof: By induction. It is left as an exercise. 
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15.5 Subtraction on the natural numbers. 


Subtraction is easily defined in terms of addition. To define subtraction we must first 
establish the following fact. 


Theorem 15.5 For any two natural numbers m and n, m E€- n if and only if there exists 
a unique natural number k such that n = m + k. 


Proof: 


By induction. Let P(n) be the statement “For any m €_ n there exists a unique natural 
number k such that n = m + k. 

Base case: Suppose n = 0. Then, for any m €_ 0, m = 0 and so there exists only k = 0 
such that 0 = n = m + k = 0 +0. So P(0) holds true. 

Inductive hypothesis: Suppose n is a natural number such that for any natural number 
m E- n, there exists a unique natural number k such that n = m + k. Suppose m is a 
natural number such that m €- nt. Then m E- nt =n U {n} implies m € n or m=n 
or m = nt. The equality m = n* means we can only choose k = 0. If m € n, then 
the existence of a unique natural number k such that n = m + k is guaranteed by our 
inductive hypothesis. So 


n = n+1 

= (m+k)+1 
m+(k+1) 
m+ k? 








In this case the required natural number is kt. If m = n, then nt = kt = k+1=m+1. 
The unique required value is k = 1. So P(n*) holds true. 


By mathematical induction P(n) holds true for all values of n. 


Definition 15.6 For any two natural numbers m and n such that m < n, the unique 
natural number k satisfying n = m + k is called the difference between n and m and is 
denoted by n— m. The operation “—” is called subtraction. 


With respect to the basic operations of addition, subtraction and multiplication on 
the natural numbers, our work is done. These operations have been shown to be 
properly defined in our ZFC-axiomatic universe. Again the intention is not to adopt 
the formal definitions as regular methods for doing arithmetic. It is only to ensure 
that arithmetic is definable in a set-theoretic context. 


A. 


Part V: From sets to numbers 143 


Concepts review: 


l; 


2: 


How is addition on the natural numbers defined? 


For any natural number n give two ways of describing n”. 


. How can we prove that 0+ n = n for all n from the definition of addition? 


. How can we prove that addition is associative from the definition of addition? 


. How can we prove that addition is commutative from the definition of addition? 


. How is multiplication of natural numbers defined? 


. How is subtraction of natural numbers defined? 


EXERCISES 


1. Use mathematical induction to prove the following multiplication properties. 


o Fe WW N 


D 


a) For all natural numbers n, 0n = 0. 
b 
c 


d 


e 


) For all natural numbers n, In = n. 

) For all natural numbers m, n and k, n(m+k) = nm+nk and (m+k)n = mn+kn. 
) For all natural numbers m, n and k, (mn)k = m(nk). 

) 


For all natural numbers m and n, mn = nm. 


. Prove that if n < k, then m+n <m +k. 

. Prove that if m + n = m + k implies n = k. 

. Prove that if m < n, then mk < nk. 

. Prove that if mk = nk and k £ 0, then m =n. 


. Prove that m +k < n + k implies m < n. 
. Prove that mk < nk implies m < n. 
. Prove that for any two natural numbers m and n, m < n if and only if there exists 


a unique natural number k such that n = m + k. 


. Prove in detail theorem 15.4. 
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16 / The integers Z and the rationals Q. 


16.1 


Summary. In this section we define both the integers, Z, and the rational 
numbers, Q. The integers are presented as a quotient set of N x N while the 
rationals are presented as a quotient set of Z x (Z—{0}). Addition, subtraction 
and multiplication on each of these are defined within the set-theoretic context. 
Order relations are defined on each of Z and Q so that they are linearly ordered 
in the way we are accustomed to. 


Constructing the set of integers Z from N. 


Most people easily recognize an integer when they see one. One might say, given the 
natural numbers N = {0,1,2,3,...,}, if we add to it all the “negative” natural num- 
bers {—1, —2,—3,...,} we obtain all integers. We are then only left with the sticky 
problem of explaining what a “negative” natural number is within our set-theoretic 
framework. 

Remember that the only mathematical objects in our set theoretic universe are sets. 
So the integer —3 must be a set of some sort. But the idea of a “negative set” is not 
very intuitive. One might say that —3 is the difference between the natural numbers 
2 and 5. But in the last section, the difference, n — m, between natural numbers was 
only defined for pairs (n,m) where the second natural number is less than or equal to 
the first one. So the expression 2 — 5 is, as yet, not defined. 


What do the ordered pairs (5,2), (10,7) and (3,0) have in common? We notice that 
the first entry minus the second entry is 3 for each of the pairs. If we consider, on the 
other hand, the pairs (1,5), (6,10) and (0,4) we see that in each case, the first entry 
minus the second entry is —4. This suggests that an equivalence relation of some sort 
on N x N may provide a useful way of representing negative integers. We explore this 
avenue to see where it leads us. 


Theorem 16.1 Let Z = N x N. Let R, be a relation on Z which is defined as follows: 
(a, b) R.(c, d) if and only ifa+d=6+c. Then R, is an equivalence relation on Z. 


Proof: 


Reflexivity: Since a+ b = b + a, (a,b) R.(a, b). 

Symmetry: (a,b)R,(c,d) > a+d=b+c>c+b=d+a> (c, d)Rz(a,b). 

Transitivity: Suppose (a, b)R,(c, d) and (c, d)R.(e, f). Then a+d = b+c and c+ f = d+e. 
This implies a+ d +c+ f =b+c+d+e. Subtracting c + d from both sides of the 
equality gives a + f = b +e. Hence, (a, b)R.(e, f). 
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Notation: If Ris an equivalence relation on S and x € S, then we will use the notation 
[z]r 


to denote the equivalence class of all elements equivalent to x under R. When the 
context indicates which relation we are referring to and there is no risk of confusion, 
we will simply write |x] instead of [z]p. 


Corollary 16.2 Let Z = N x N be equipped with the equivalence relation R, defined as: 
(a, b)R.(c,d) < a+d=b+c 


for each n € N, let [(0,n)] and [(n,0)] denote the R,-equivalence classes containing the 
elements (0,n) and (n,0) respectively. Then the quotient set induced by R; on Z can be 


expressed as Z/Rz = {[(0,n)]:n E€ N}U {[(n, 0)] : n € N} 
Proof: 


We will start by showing that the equivalence classes in Z/R, cover all of Z. That is, we 
will show that 


ZCU=|[J[O,n)] u (ia o] 


neN neN 


Let (a,b) € Z. We will show that (a,b) € U. Suppose (c,d) € [(a,b,)]. Then 
(a, b)R-(c, d) =a +d = b+ c. We consider two cases, d < c and c < d 


d<c > a+d-d=b+c-d 
=> a+0=b+(c-d) 
(a, b)R-((c— d), 0) 
at+d—c=b+c-—c 
a+(d—c)=6b+0 
(a, b)R-(0, (d—c)) 


UUL 


So if (a,b) € Z, then either (a,b) € [((c — d),0)] or (a,b) € [(0,(d — c))]. Thus, 
Z C U = Unenl(0,2)] U Unen[(r,0)]. So every element of Z = N x N is an ele- 
ment of some equivalence class in Z7/R,. 


Next we show that if m and n are distinct, then (0, n) and (0, m) are not related with 
respect to Rz: 


mAn > 0+nAm+0 
= (0,m) ¢ [(0, n)] 
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Since R; is reflexive, (m,0) ¢ [(n,0)]. We show that the elements of Z/R, with dis- 
tinct representatives do not overlap: For if m 4 n and (x,y) € [(0,m)]/M [(0, n], then 
(0, m)R-(x, y) and (x, y)R-(0, m) implies (0, m)R,(0, n) (by transitivity), a contradiction. 
So the sets in {[(0,n)] : n € N} U {[(n, 0)] : n € N} represent all equivalences class of Z 
induced by R,. 


We have set the stage for a set-theoretic definition of the “integers”. Some readers 
may have some insight on where this is leading. It seems that the plan is to have the 
equivalence class [(0, n)| represent the negative integers 0 — n = n and [(n,0)] repre- 
sent the positive integers n — 0 = n. We can equate —5 with [(0,5)] and the integer 
5 with [(5,0)]. Some may immediately wonder: Why do the positive integers need 
defining? Aren’t positive integers just the natural numbers? How can the natural 
number 5 = {0, 1,2,3,4} be the same set as the integer 5 = [(0,5)]? These two sets 
are indeed different since they don’t contain the same elements. It is true, the “nat- 
ural number 5” and the “integer 5” have different set representations. The question 
is: Is this a major problem or is it just a minor annoyance? It may be possible to 
construct the integers with the specific requirement that the sets which represent the 
positive integers and the sets which represent the natural numbers be the same. But 
this constraint may present some hurdles around which it may be difficult to maneu- 
ver. When we think about it carefully, it is not the sets which represent the natural 
numbers and the sets which represent the positive integers that are important. What 
is however crucial is that the arithmetic operations on these sets each produce the 
expected values. That is, both 5 + 3 and [(5,0)]+ [(3,0)] produce 8 “the natural 
number” and 8 = [(8,0)] “the positive integer” respectively. With this in mind we 
proceed with a formal definition of the integers. 


Definition 16.3 The set of integers, Z, is defined as: 
Z = 2/Rz = į{[(a,b)] : a,b € N} = {[(0, n)] : n € N} U {[(n, 0)] : n € N} 
a) Negative integers: The set of negative integers is defined as being the set 
Z- = {[(0,n)]: n € N} 
Positive integers: The set of positive integers is defined as being the set 
Z* = {[(n,0)]: n € N} 


If n is not 0, the elements of the form [(0,n)] can be represented by —n = [(0,n)] 
while the elements of the form [(n,0)] can be represented as n = [(n, 0)]. 


b) Order relation on Z: We define a relation <, on Z as follows: [(a,b)] <- |(c, d)]| if and 
only ifa +d < b+c. It is a routine exercise to show that <, is a linear ordering of Z. 
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c) Addition on Z: We must sometimes distinguish between addition of natural numbers 
and addition of integers. Where there is a risk of confusion we will use the following 
notation: “+,” means addition of natural numbers while “+,” means addition of 
integers. 


Addition +, on Z is defined as: 
[(a, b)] +2 [(c, d)] = [(a +n c, b +n d)] 


d) Opposites of integers: The opposite —[(a, b)] of [(a, b)] is defined as 
—[(a, b)] = [(b, @)]? 


(4 ” 


e) Subtraction on integers: Subtraction “—,” on Z is defined as: 
[(a, b)] -z [(e, d)] = [(a, b)] + (-[(e, d)])? 
f) Multiplication of integers: Multiplication x, on Z is defined as 
[(a, b)] xz [(e, d)] = [(ac + bd, ad + be)].3 
In particular, [(0,7)] xz [(m,0)] = [(0 + 0,0 + nm)] = [(0,nm)] = —[(mm,0)] and 
[(n, 0)] xz [(m, 0)] = [(nm, 0)]. 
g) Absolute value of an integer: The absolute value, |n|, of an integer n is defined as 


n= n if O<n 4 
~ -n if n<,0 


h) Equality of two integers: If (a,b) and (c,d) are ordered pairs which are equivalent 
under the relation R,, then the R,-equivalence classes |(a,b)] and [(c,d)] are equal 
sets. To emphasize that they are equal sets under the relation R, we can write 


[(a, b)] Th [(e, d)] 
i) Distribution properties: If |(a,b)], [(c, d)] and [(e, f)] are integers, then 
[(a, b)] xz (Ie, d)] +2 [(e, f)]) =z [(a, b)] xz [(c, d)] +2 [(a, b)] xz [(e, f)] 


and 


(e, d)] +2 [(e, PI) xz [(a, b)] =z [(c, d)] xz [(a, b)] +z [(e, f)] xz [(a, b)] 


‘Note that —[(n,0)] = [(0, n)] n. 

?When there is no risk of confusion with subtraction of other types of numbers we will simply use 

3Note that the “center dot” can be used instead of the “x symbol. When there is no risk of confusion 
with multiplication of other types of numbers we will simply use “x”. 

“View “absolute value” as a function | |: Z > Z. 








“un 
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It is good to remember that any integer can be written in the form [(n, 0)] or [(0, n)] = 
—[(n,0)]. These forms make it easier to add and multiply them without memorizing 
intricate formulas. For example, the expression |(2,4)] x+ [(5,2)] can be more easily 
computed as follows: 


[(2, 4)] Xz [(5, 2)] = [(0, 2)] Xz [(3, 
2,0 


I 
| 
x 
XR 
— 
—~ 


We verify that the product of two non-negative integers produces a positive integer 
as it should. 


[(2, 4)] Xz [(2, 5) =z [(0, 2)] Xz [(0, 3)| 
=, [(0+6)+0] 
=z [(6, 0)] 


Remark: We pause to deconstruct the elements of Z to better see the nature of the 
sets that belong to it. Let u € Z. Then u = [(a,b)| for some a,b € N. By lemma 4.5, 
(a,b) E NxN E A(A(N)). Since [(a, b)]| c A(A(N)) then [(a, b)] E A(A(A(N))) = 
P?(N). We conclude that Z C P3(N). 


Constructing the rational numbers Q from Z. 


We have succeeded in “extracting” the integers Z from N x N by constructing a quo- 
tient set induced by a particular equivalence relation on N x N. To construct the 
rationals we will proceed in a similar way. 


When looking at a rational number, 7, it may be useful to view it as an ordered pair 
of integers (a, b) of integers where the first entry plays the role of the numerator while 
the (non-zero) second entry plays the role of the denominator. But simply defining a/b 
as an ordered pair (a,b) in Z x Z would not do, since a rational number, say —2/3, can 
have many equivalent forms: 2, =, 2. So the associated ordered pairs of integers 
(—2, 3), (—4, 6) and (20, —30) should also be equivalent forms of the same number. To 
overcome this difficulty we will define an equivalence relation on Q = Z x Z* (where 
Z* = Z — {0}) so that all equivalent forms of (—2,3) belong to an equivalence class 
[(—2, 3)] induced by this equivalence relation. We have chosen the Cartesian product 
Z x Z* rather than Z x Z since the second entry cannot be zero. The equivalence 
relation we will use to extract Q from Q = Z x Z* will be represented by R,. To 
define this equivalence relation R we will ask ourselves: What property makes the 
two rational numbers —2/3 and = equivalent? We see that 
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—2 lo cf : 
ap implies (—2)(—12) = (3)(8) 


More generally we see that 


a c, ; 
B= qifand only if ad = bc 


We want the ordered pairs (a, b) and (c, d) in Z x Z* to be related under R4 provided 
they satisfy the property a x, d = b x, c. Proving that this is a valid equivalence 
relation is routine. It is formally stated as a theorem. 


Theorem 16.4 Let Q = Z x Z* where Z* = Z — {0}. Let Ry be a relation on Q defined 
as follows: (a, 6) R,(c, d) if and only ifax,d = bx zc. Then Rg is an equivalence relation on Q. 


Proof: The proof is left as an exercise. 


For example, consider the two elements (6, 10) and (15, 25) of Z x Z*. Since 6 x, 25 = 
150 = 10 x, 15, they are equivalent rational numbers. So [(6,10)]~ =; [(15, 25)]g. 
Remember that [(a, b)], will represent the set of all elements of Z x Z* which are 
R,-equivalent to the element (a,b) € Z x Z*. When there is no risk of confusion with 
the equivalence class of another equivalence relation we will simply use [(a, b)] rather 
than [(a, ]¢. 


We now formally define the rational numbers within a set-theoretic context. 


Definition 16.5 The set of rational numbers, Q, is defined as: 
Q = Q/R; = {[(a,b)] : a € Z,b € Z}} 
The expression [(a, b)] is normally written in the form $. 


a) We define a relation <q on Q as follows: If b and d are both positive, [(a, b)] <q [(c, d)] 
if and only ifa xz d <; b Xz c. 


b) Addition +, on Q is defined as: 


[(a,b)] +4 [(c, d)] = [(ad +, bc, b x, d)] 


c) Subtraction —, on Q is defined as: 


[(a, b)] —4 [(c, d)] = [(a, b)] +4 [(—e, d)] 


‘Recall that a and —b is shorthand for expressions of the form [(a,0)] or —[(0,b)] 
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d) 


e) 


=R 
Ww 
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Multiplication x, on Q is defined as 


[(a, b)] x4 [(c, d)] = [(a xz c, b xz d)] 


Equality of two rational numbers: If (a,b) and (c,d) are ordered pairs of integers 
(where neither b nor d is 0) which are equivalent under the relation Rq, then the 
R,-equivalence classes |(a,b)] and [(c, d)] are equal sets. To emphasize that they are 
equal sets under the relation Ry we can write 


[(a, b)] =¢ [le d)] 


Opposites of rational numbers. If (a,b) is an ordered pair of integers (b # 0) and 
[(a, b)] is its Rọ-equivalence class, then the opposite of the rational number [(a, b)] is 
defined as [(—a, b)] and is denoted as —[(a, b)] =; [(—a, b)]. 


Proofs that addition, subtraction, multiplication and linear ordering, thus defined, 
reflect precisely what we normally obtain when performing the usual algorithms on 
Q is left as an exercise. Once this is verified, the reader is, of course, free to use the 
usual algorithms for computation involving rational numbers. When convenient, we 
will interchangeably represent Ry equivalence classes [(a, b)] as ¢ and vice-versa. 


Remark: We pause to deconstruct the elements of Q to better see the nature of the 
sets that belong to it. Recall that Z C #(N). Let u € Q. Then u = [(a, b)] for some 
a,b € Z. By lemma 4.5, (a,b) € Z x Z* € P(A(Z)). Since [(a, b)] C A(A(Z)) then 
[(a, d] € P(PA(A(Z))) = P?(Z). We conclude that Q C #3(Z) C P?(P3(N)) = 
P(N). 


Theorem 16.6 Suppose a and b are positive integers where b # 0 and [(a,b)] is an Ry 
equivalence class. Then 


a) 
b) 


[(—a, —b)] = = = § = [(a, b)]. 
—[(a, 6)] =q [(—a, 6)] = 7 = = [(@, —b)] 


Proof: 


a) Since —a x, b = —b x, a, then (—a, —b)R,(a,b). Then (—a,—b) € [(a, b)] and so we 
can write = = [(—a, -b)] =, [(a,0)] = $ 


b) Note that if a and b are positive integers, then since —a x, —b = b x, a, then (—a, b) 
is R,-equivalent to (a, —b) and so 


—a a 


F = [(-a, b)] = -[(2, 8)] = Ka, =) = 5 
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Hence, moving negative signs from numerators to denominators and vice versa in rational 
numbers is justified. 


Is an integer equal to a rational number in this set-theoretic context? To help answer 
this question we will compare a particular integer to its equivalent rational number 
form. In our set-theoretic axiomatic system, the integer —3 looks like [(0,3)], = 
—|(3,0)], C N x N. In the same axiomatic system the rational number —3/1 looks 
like [(—3, 1)]q C Z x Z*. So, they are not the same set. However, it can be verified to 
one’s satisfaction that operations made with integers n = +|(n,0)], when viewed as 
rational numbers [(n, 1)]q will provide results which consistently match those obtained 
by performing parallel operations with integer numbers.! 





Examples. 


The following examples illustrate that doing arithmetic by referring to the set-theoretic 
definitions and the listed properties is a bit awkward and requires some thought. It 
is of course not an efficient way of doing arithmetic. To see this we compute the 
following expressions by representing these numbers as equivalence classes of ordered 
pairs and using the above definitions. When useful we indicate which of the above 
statements are invoked to justify various steps. 


a) Compute —4(3— 7) b) Compute + (£ — #) 


Solution: 


7,0)]) 
—7, 0)]) (By theorem 16.6, b) .) 
) 


0, 7)] 
0 





1 Just as a matter of interest let’s see how our number systems are evolving in our set-theoretic universe: 
Natural numbers N: Exists by Axiom 8. 
The integers Z: n € Z > n = [(n,0]- C N x N > n = [(n,0]- E P(N x N). Then ZC P(N x N). 
The rational numbers Q: a/b € Q = a/b = [(a,)]g C Z x Z* = a/b = [|(a,b]- € A(Zx Z) = a/b € 
P(P(N xN) x P(N xN) > Qc A(A(N x N) x P(N xN)). 
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SE-B) = 25x (6. D- 610) 
a (2,5) xa (U6. D) +a (3,111) ey tom 6,9 9 
=, [(-2,5)] xq [(6x21l +2723, 7X21) 
=a [(-2,5)] x4 [(45,77) 
=, [(-2x, 45, 5 x,77)] 
=, [(—90,385)] 
=, (18,77) 
=, -[(18,77) 
18 
= -7 


Concepts review: 


1. Describe the equivalence relation R; on N x N used to define the elements of the 
integers Z. 


2. Describe the equivalence class induced by R; on N x N which represents the integer 
—9. What about the integer 3? 


3. Do the equivalence classes {[(0,7)] : n € N}U {[(n,0)] : n € N} account for all the 
equivalence classes induced by R; on N x N? 


4. How is addition +, defined on Z in a set-theoretic context? 
5. How is multiplication x, defined on Z in a set-theoretic context? 


6. Describe the equivalence relation Rg on the Cartesian product Z x Z* used to define 
the rational numbers Q. 


7. How is addition +, defined on Q in a set-theoretic context? 


8. How is multiplication x, defined on Q in a set-theoretic context? 
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EXERCISES 


1. Use the definitions in 16.3 to show that the following statements are true: 


a) —2 < 10. 

b) 0<3. 

c) -5-7=-12 

d) -2+6=4 

e) 7 x —2 = —14 

f) -1 x —2 = 2. 
2. Use the definitions in 16.5 to show that the following statements are true: 

a) 4 = 3. 

b) B<h 

c) 34+3=2%. 

TETE 

e) 8x5 =2. 

f) 3s. 


3. Let Z = NxN. Let R, be a relation on Z that is defined as follows: ((a, b), (c, d)) € R 
if and only if a+ d = b + c. Prove that R, is an equivalence relation on Z. 

4. The relation <, on Z is defined as follows: |(a,b)] <+ [(c, d)] if and only if a+d < b+c. 
Show that this is a linear ordering. 

5. Let Q = Z x Z* where Z* = Z — {0}. Let R4 be a relation on Q defined as follows: 
((a,b), (c,d)) € Rq if and only if a x; d = b x; c. Show that Ry is an equivalence 
relation on Q. 


6. We define a relation <q on Q as follows: When both b and d are non-negative, 
[(a, b)] <q [(c, d)] if and only if a x,d < b xz c. Show that this is a linear ordering. 
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17 / Dedekind cuts: “Real numbers are us!”. 


Summary. In this section we show how R is defined within the confines of 
the ZFC-set-theoretic axiomatic system. With this objective in mind we begin by 
defining “initial segments” of rational numbers. These special elements of P(Q) 
are referred to as “Dedekind cuts”. A linear order relation is defined on these 
as well as operations of addition and multiplication. We then show that there is 
a function, f, which maps the real numbers R to the Dedekind cuts one-to-one 
and onto while respecting order, addition and multiplication. Finally we show 
that the set of all Dedekind cuts satisfies the essential Completeness property of 
the real numbers. 





Defining the real numbers R. 


Numbers such as 2 and a were known to be “not rational” long before earnest at- 
tempts were made to define the real numbers. Defining a real number by stating that 
“it is a set of numbers some of which are not rational numbers” is not satisfactory, 
since today we know of numbers which are neither real numbers nor rational num- 
bers.! Making a list of the basic properties which define the real numbers and then, 
postulating the existence of a set which satisfies these properties is a possibility. But 
a new axiom is not absolutely necessary since, as we shall soon see, the existence of 
real numbers can be proven by only invoking the set theory axioms we have seen up 
to now. At this point, we have invoked, at some point in our study, all of the axioms 
numbered 1 to 6 and 8. Axiom 8 is the Axiom of replacement, Axiom 9, called the 
“Axiom of regularity” and the tenth axiom called the “Axiom of choice” have not yet 
been invoked; we will see that these two axioms are not required to construct the set 
of all real numbers. 


The main challenge we encounter when studying those real numbers which are not 
rationals is that we cannot “see” them directly, even though there is evidence that 
they exist as numbers. We can see the rationals and easily order them on a num- 
ber line. For example, the number 2/5 can be seen by subdividing the line from 0 
to 1 into five equal subintervals; the number 2/5 is the right endpoint of the sec- 
ond subinterval. Any attempt to “see” v2 by subdividing a line interval in this way 
will fail, even though this number is a solution of the equation x? — 2 = 0, and we 
have a approximate idea of where it sits on the number line. Thousands of years 
ago, the Greeks knew that the length of the diagonal line of a one by one square was 
not a quotient of natural numbers and so could not be measured in conventional ways. 


"We are referring here to the “complex numbers”. 
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It is true that the real numbers are, in many respects, similar to the rational numbers. 
The set Q is one which is linearly ordered so that we can associate to any pair of dis- 
tinct rationals a and b satisfying a < b a third rational number c such that a < c < b. 
The set R also has a linear ordering which satisfies this property. But there is a 
property which distinguishes the reals R from the rationals Q in a fundamental way. 
This property is called “the completeness property’ 1. It states that “Every non-empty 
bounded subset S of R has a least upper bound which is in R”.? The set of all ratio- 
nal numbers Q does not satisfy this property. For example, the set S of all rational 
numbers which are strictly less than v2 is bounded above; but the non-empty subset, 
Q- S, of all upper bounds of S has no “least upper bound” (or if one prefers, no min- 
imal element) which is contained in Q. That is, for any rational upper bound q of S 
which is larger than V2, we can find a smaller upper bound rational number t such that 
V2 <t <q. We mention this now, since we will be attempting to identify sets which 
can represent the elements of R in a universe governed by ZFC. Contenders will have 
to successfully pass the “completeness property” test. We will come back to this soon. 


A very elegant and clever set-theoretic definition of the real numbers was put forward 
by the mathematician Richard Dedekind (1831-1916). Dedekind identified a subset 
of Y(Q) which successfully represents the set IR. We first express a few preliminary 
ideas that may have led Dedekind to formulate his definition of R. 


17.2 Projecting R into Y(Q). 





In what follows, we will construct a one-to-one function which will map R into A(Q) 
while respecting the order and the basic algebraic properties of the real numbers. We 
will begin with the definition of special subsets of R and Q. 


Definition 17.1 For any real number r, let (gr) denote the interval (—oo,r) in R. It is 
the subset of all real numbers strictly smaller than r. The subset (1) is called an initial 
segment in R. For any real number r, the subset (—r) of Q is defined as follows: 


(ar) = (—0o, r) NQ= (rr) NQ 


The subset (or) of Q is called an initial segment in Q. For both (gr) and (gr) the real 
number r will be referred to as the leader of the initial segment. Note that r may be an 
irrational number even for (gr) C Q.” The number r is also seen to be the least upper 
bound of (or) in R.A 


1Tt is also referred to as the “Least upper bound property”. 

2We can also say: The set U of all upper bounds of the subset S of R has a minimal element which is in 
R. 

3For example, (=o /2) is a subset of Q whose leader is the irrational number V2. 

“The reader will recall that m is an upper bound of a set S if s < m for all s in S and t is the least upper 
bound of S if t is an upper bound of S and t < u for all upper bounds of S. 
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For each real number r, (7) is a subset of Q and therefore is an element of P(Q). 
So the set 


2 =4(r):r E R} 


is a subset of Z (Q). 


We first show how the elements of 2 can be linearly ordered; then we show how to 
add and multiply its elements. 


a) A linear ordering of 2: Since the elements of J are subsets of Q we will order the 


elements of 2 by inclusion. That is, we define “<” on 2 as follows: 
(or) < (at) S (ar) C (gt) 


Notice that this ordering of 2 is one which rigorously respects the order of their 
leaders in R. That is (gr) < (gt) = r < t. For example, since —2 < 7, then 


(72) C (o7) and so (9—2) < Gam) 


Furthermore, (g4) = (9) if and only if a = b. If (_ ge) and (9d) are distinct 
elements of 2, then c and d are distinct real numbers and so either c < d or 
d < c. Hence, either (gc) C (94) or (gd) C (g€). So “<” linearly orders 
the elements of 2. 


Addition on 2: We define addition in J as follows: 
(or) a (ot) = {x +yireE (ar) and YE (ght 


Note that addition of the elements of 2 rigorously respects the addition of its 


leaders. For example (—9—5) + (497) = (49-5 + 7) = (2). 


Multiplication on Y: In the case where both r and t are greater than zero we 
define multiplication as: 


(Kor) (Kot) = {zy : x E (Kor) and y E (Kt), z, y > O} U [(—co, 0) N Q] 


The more general definition of multiplication on Y, which includes products of 
negative numbers, is a bit more complicated. 


In general: 
(99) for the case where r or t is 0. 
(ag |r|) (o |t|) for the case where r and t are both positive 
(or) (x98) = or both negative. 
=o |r|) (ltl) for the case where precisely one of r or t 


is negative. 
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Note that multiplication of the elements of 2 rigorously respects the multiplica- 
tion of its leaders. For example: 


(95) (97) = (H95 x 7) = (935) 
(o 2) (a-v 2) = -(- V2 x v2) = =(e52) 
(~9—4) (— 99) = (+99) 

(2) (eg-1) Skt. Sag) 


Basic addition and multiplication properties. 


The definitions of addition and multiplication are such that the following fundamental 
properties are all satisfied: 


(o0) + (90) = (0 +a 
(0) (00) = (99 x a 
(074) + (490) = (7a +a 





aae Mice Mel Nee Malt 
I 
NNN 
T 
© 
© 


We see that: 


— (90) plays the role of the additive identity in 2 

— (90) plays the role of the multiplicative zero element in 2 
— every element in 2 has an additive inverse 

— (91) plays the role of the multiplicative identity in 2 

— every non-zero element of 2 has a multiplicative inverse 


We also see that this subset 2 C Y(Q), when linearly ordered by inclusion, mimics 
in every way the behaviour of the real numbers. 


Defining a one-to-one onto function from R onto 2. 


The function f : R — 2 defined as 


f(r) = (eer) 


is a natural one-to-one mapping which copies R into A(Q) in the form of 2 = { (—9r) : 
reéR}. 


This function, f, respects the linear ordering, addition, multiplication in R: 
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r<t  f(r)=( 
fett) = (arti) = 
fr xt)= (rxt) = 


r) C (cet) = f) 
)= f(r) 
f 


—Q 
ar) t (gt r 
ag!) X (Hot) = (r) 


e E e 


We first provide an informal definition of “Dedekind cuts” based on an understanding 
of initial segments of rational numbers as described above. 


Definition 17.2 Informal definition. The elements of the set 2 = {(— gr) : r € R} where 
(or) = (œ%,r)N Q, are called Dedekind cuts. 





17.3 Defining the real numbers R as Dedekind cuts. 


We now have the background and the ingredients needed to provide a set-theoretic 
definition of the real numbers. 


Definition 17.3 The set of all Dedekind cuts 2, linearly ordered by inclusion with addition 
+ and multiplication x (as described above) is called the real numbers. Those Dedekind 
cuts which have no least upper bound in Q are called irrational numbers. 


“Those Dedekind cuts which have no least upper bound in Q” means those Dedekind 
cuts whose leader is a real non-rational number. Now defining the set of real num- 
bers while referring to real numbers (in the definition of Dedekind cuts) is of course 
inappropriate. To make things right, we will go back to our informal definition of 
Dedekind cuts and remove any reference to real numbers. That is, we rewrite this 
definition (without altering its meaning) so that it assumes no knowledge of the set 
of real numbers on the part of the reader.! 


Definition 17.4 A Dedekind cut is a subset © of the rational numbers Q which satisfies 
the following properties: 


1. The set SAPandSFQ. 


1Kronecker, a very influential leading mathematician of his time was quoted as saying to the mathemati- 
cian Lindmann: “Of what use is your beautiful investigation of 7. Why study such problems when irrational 
numbers do not exist.” 
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2. For any two rational numbers a and b , ifa € S and b <a, then b € S. 


3. The set S contains no maximal element. 


This definition describes precisely those sets which can be expressed in the form 
(—oo,r) 1 Q = (or) for some real number r. To help us see this we make the 
following comments: 


— By condition 1, S is a non-empty proper subset of Q and so there exists a rational 
element q which does not belong to S. 


— Ifa € S, then a < q since, by condition 2, q < a would imply q € S. Also, by 
condition 2, (co, a] C S. Since S is bounded, then by the completeness property 
of the real numbers, S has a least upper bound, say r. 


— By condition 3, r cannot belong to S. Then S = (—o0,r) NQ = (qr). 
We conclude that: 


“Any set which satisfies the three conditions in the formal definition of a 
Dedekind cut is of the form (gr) for some real number r.” 


Proving that 
“Any set of the form (7) for some real number r is a Dedekind cut.” 
is left as an exercise. 


We conclude that the set 2 of all Dedekind cuts can conveniently be expressed as 
P ={(Kor): 7 E R} where (_gr) = (co, r) NQ 


as stated in the informal definition 17.2. 


So if (97) is a Dedekind cut with a rational number as leader, then this Dedekind 
cut can be referred to as a rational number. But if those with eyes “for-rational- 
numbers-only” cannot perceive a leader for (7), only seeing a ray of rationals lining 
up towards negative infinity with nothing at its head, then this Dedekind cut is an 
irrational number. 


We present a few examples of Dedekind cuts: 


(92/3) — {x € Q PE A 2/3} An initial segment of Q with leader 2/3. This is a rational number. 
(9-150) = {x € Q £ < —150} An initial segment of Q with leader —150. This is a rational number. 


(a772) 


{x € Q AE — v2} An initial segment of Q with no rational number as a leader. 
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17.4 Completeness property of the real numbers R. 


We will now verify if the set of all Dedekind cuts satisfies the completeness property 
(or as it is often called, the Least upper bound property). We remind ourselves what 
this property states: 


“Every bounded subset of R has a least upper bound which is a real num- 
ber.” 1 


The completeness property is one which distinguishes R from other infinite linearly 
ordered sets. If the set 2 does not satisfy this property it disqualifies it from being 
called the “real numbers”. We will show that the set of all Dedekind cuts, linearly 
ordered as described above, passes the test for completeness. We first prove a lemma. 


Lemma 17.5 The union of a non-empty set of Dedekind cuts is either itself a Dedekind 
cut or is the set Q. 


Proof: 


Given: That U is a non-empty set of Dedekind cuts. 

What we are required to show: That U{V : V € U} is Q or is of the form (gr) for some 
reéR. 

Case 1: Suppose U = {(_ gt) : t € R}. Since every rational number q belongs to (qt) 
for some real number t, then Q = User (ot), and we are done. 


Case 2: Suppose that U{V : V € U} # Q. Then there is a proper non-empty subset 
M C R such that U = {(—9t) : t E€ M} and Uem (Hot) # Q. It suffices for us to show 
that Uem (ot) is a Dedekind cut. 


Then there exists some u € Q such that u ¢ (ot) for allt € M. Then t < u for all 
te M. This means that u is an upper bound of M C R. 

By the completeness principle for the real numbers, since M is bounded in R, M has a 
least upper bound, say v € R. 


We claim that Uren (et) = (o0). 


— We first show that (gv) C Useng (Hot): 
Let z € (gv). Then there exists t € M such that z < t < v (for if t < z for all 
t € M, then z is an upper bound of M, a contradiction of the definition of v). So 
z € (gt) EU. Then (gv) © Uren (gt) must hold true. 


1Note that this property is often expressed in many different but equivalent forms. The following prop- 
erties are all equivalent to the Completeness property: 1) The limit of every infinite decimal sequence is a 
real number, 2) Every bounded monotonic sequence is convergent, 3) A sequence is convergent if and only 
if it is a Cauchy Sequence. Googling the words “Completeness property” may direct the internet surfer to 
any one of these. 
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— We now show that Users (ot) E (Hv): 


Let u € (ot) for some t € M. Since t € M, t <v. Then u E€ (gt) C (. gv). So 
Ure (et) E (Av) as claimed. 


So Usem (Hot) = (Kev), a Dedekind cut as claimed. 
So the union |J; m (gt) of all elements of U = {(— gt) : t € M} is a Dedekind cut. 


Theorem 17.6 Let Y denote the set of all Dedekind cuts linearly ordered by C. Then if 
SY is a non-empty bounded subset of 2, Z has a least upper bound (with respect to the 
ordering C). 


Proof: 
We are given that Z is a non-empty bounded subset of 2. Then Z is of the form 
F ={(ot):t eM CR} 


for some proper subset M of R. Since is a bounded subset of Y with respect to C, 
there exists k € R such that (t) C (9) for allt € M. Then its union U = Use yy (Ko!) 
cannot be all of Q. By the lemma, its union U is a Dedekind cut, say, (—,v). 


We claim that U = (—,v) is the least upper bound of .” with respect to C: Since 
Urem (ot) = (oY), then (gt) © (ov) for all t in M and so the Dedekind cut 
(oV) is an upper bound of Z. Suppose (u) is another upper bound of .7. Then 
U = Uem (Hot) E (Kg). So (oV) E (Kg). We must conclude that (_9v) is a least 
upper bound of Z. 


So 2 satisfies the completeness property. 


From this we conclude that the set of all Dedekind cuts, 2, represents the set of all 
real numbers in the ZFC set-theoretic universe. Thus, from the primitive concepts 
“class”, “set” and “belongs to” and the axioms one to eight we have successfully de- 
fined the sets of natural numbers, integers, rational numbers and real numbers. The 
elements of these sets are themselves sets. If the existence of the natural numbers N is 
almost an immediate consequence of the Axiom of infinity, the other axioms provided 
the necessary tools to construct from N the integers, rationals and real numbers (as 
sets). 


Remark: Having defined the real numbers as Dedekind cuts, we see that every real 
number u is a subset of Q and so u € P(Q). Since Q C #P*(N) (see page 150), 
u € P(PË(N)) = Y"(N) and so, R C #"(N). We can now see why the Aziom 
of power set plays an essential role in the ZF'C-universe. Had we not declared that 
“Y(S) is a set whenever S$ is a set” what guarantee would we have that the real 
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numbers exists in our universe. 


We have not yet invoked two axioms: Axiom A9, called the Axiom of regularity and 
the Axiom of choice. These two axioms will help us handle certain difficulties encoun- 
tered while dealing with infinite sets, the main subject of our investigation for the rest 
of this book. 


Concepts review: 


1. 


2x 


10. 


11. 


What is an initial segment? What is its leader? 


How is addition of initial segments defined? 


. If r and t are positive real numbers, how is multiplication of (gr) and (gt) defined? 
. Provide a definition of Dedekind cuts. 

. Define a function f which maps R one-to-one onto the set 2 of all Dedekind cuts. 

. Give a set-theoretic definition of the real numbers. 

. What can we say about the union of a family of Dedekind cuts? 


. What does the Completeness property of the reals (equivalently, Least upper bound 


principle of the real numbers) state? 


. Does the set of all Dedekind cuts satisfy the Completeness property? 


How are the elements of all Dedekind cuts ordered? 


How does a Dedekind cut representing a rational number differ from one representing 
an irrational number? 


EXERCISES 


A. 


1. Perform the following operations on the given Dedekind cuts: 
a) (073) x (91/3) 
b) (41/3) + (495) 
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c) (95) x (490) 
d) (71) + (o1) 


. Show why (~g—1) + (—05) < (491/38) + (910). 

. What is the least upper bound of Uco (—9¢)? 

. Find a Dedekind cut strictly in between the two cuts (_,1/2) and (92/3). 

. Is there a Dedekind cut that contains all Dedekind cuts? If so, what is it? If not, 


why not? 


. How does the set-theoretic definition of R guarantee that it is a set? 
. What link can we make between the set theoretic definition of R and the natural 


numbers N? 


. Show that any set of the form (gr), where r is a real number, satisfies the three 


conditions in the formal definition of a Dedekind cut. 


. Let f : R — Y(Q) be a function defined as follows: f(a) is the least upper bound 


of {y € Q: y < x}. What is the image of R under the function f? 
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18 / Infinite sets versus finite sets. 


18.1 


18.2 


Summary. In this section we give a definition of “infinite set” as put forward 
by Richard Dedekind. We then establish a few of the most basic properties of 
infinite sets and compare them to those of finite sets. These properties allow us 
to characterize finite sets as those sets which are in one-to-one correspondence 
with some natural number, n. We also show that for any finite set S, P(S) is fi- 
nite. Finally, we prove a version of the “Recursively defined function theorem”. 


Infinite sets. 


The notions of “finite set” and “infinite set” are often viewed as being opposites of 
each other, in the sense that if we define one of these, then the other is its negation. 
We all have an intuitive idea of what a finite set is and how it differs from an infinite 
set. Most would agree with the statement “A finite set is a set whose elements you 
can count so as to determine how large or how small it is”. We would of course first 
have to explain what it means to “count” and what it means to determine “how many 
elements there are in a set”. Our definition of “finite” would have to be such that we 
can say “every natural number is finite”. 


The concept of “infinite set” is abstract, purely an idealization of something we find 
useful when discussing certain topics, even though it impossible to perceive. Yet, 
for anyone who studies or uses mathematics in any field, whether it be engineering, 
physics, or social sciences, doing mathematics without referring to “infinity” or “in- 
finite sets” would feel like trying to walk with both shoelaces tied together. When 
the word “infinite” is used in a conversation, only a mathematician might possibly 
ask “What do you mean when you say the word infinite?”. Most individuals might 
embarrassingly respond “I don’t really know, but I think everybody intuitively under- 
stands what we mean when using that word.” Appropriately defining “infinite sets” is 
important, not because it would be an interesting mathematical exercise to do so, but 
because if we want to make sense of the mathematics we do today, we have no choice. 
Experience shows that doing mathematics without clearly defining the concepts we 
are referring to can lead to contradictions or erroneous results. In this text, we have 
informally used the words “finite” and “infinite” before, but never in a mathematical 
statement proven to be true or false. If we want to refer to infinite and finite sets 
in theorem statements or definitions these must be precisely defined. We choose to 
provide a definition of “infinite set” and then define “finite set” as being one which 
is not infinite. 


Dedekind’s definition of infinite set. 


We start with the definition of an infinite set as evoked by the mathematician Richard 
Dedekind (1831 - 1916). 
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Definition 18.1 A set S is said to be an infinite set if there exists a one-to-one function 
mapping S onto a proper subset of itself. If a set S is not infinite, then we say that it is a 
finite set. 


18.3 


18.4 


Surprised? Well, it is better than saying that an infinite set is “a set with lots of 
things in it”. (A set containing a billion pencils has a lot of things in it and yet no 
one would describe it as being an infinite set.) Dedekind’s definition is succinct and 
without ambiguities since it is only expressed using words that have been previously 
defined. An infinite set is a set which properly contains a one-to-one image of itself. 
If we were to define finite sets as being those sets that are not infinite, then we could 
say that “a set S is finite if and only if S contains no proper subset T which is a one- 
to-one image of itself”. For example, since the natural number 6 = {0,1, 2,3, 4,5} 
cannot be in one-to-one correspondence with any of its elements, it cannot be infinite. 
When a function f : A —> B maps a set A one-to-one into a set B we often say that f 
embeds A inside B in the sense that B contains a “copy” of A. Using this vocabulary 
we can say that “S is infinite if and only if it is embedded into a proper subset of itself”. 


Example: N is an infinite set. 


The set, N, of all natural numbers was defined as being the smallest inductive set. We 
test our definition of “infinite set” on N. The set N is infinite only if we can produce 
a function f : N — N which embeds N into a proper subset of itself. Consider the 
function f : N — N defined as f(n) = 2n. We see that f is one-to-one and that the 
image, f[N] = {0,2,4,6,...,}, of N under f is a proper subset of N. We have proven 
that: 


“The natural numbers N is an infinite set.” 


Properties of infinite sets and finite sets. 


The following theorems confirm that infinite and finite sets satisfy the properties we 
expect from them. Ultimately, we want to show that the finite sets are precisely those 
sets which are in one-to-one correspondence with some natural number. Surprisingly, 
this does not follow immediately from our definitions of infinite and finite sets. 


Theorem 18.2 Basic properties of infinite and finite sets. 


1 Actually records show that Bolzano suggested in 1847, (before Dedekind) that an infinite set was a set 
that could be mapped one-to-one onto a proper subset of itself, a property was is not satisfied by finite 
subsets. 
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a 
b 
c 


) 
) 
) 
d) 


The empty set is a finite set. 
Any singleton set is a finite set. 
A set which has a subset which is infinite must itself be infinite. 


A subset of a finite set must be finite. 


Proof: 


a) 


= 


The empty set Ø has no proper subsets and so a function f cannot map Ø into a 
proper subset of @. So @ is finite. 


The singleton set, {x}, contains only one element x. Since x is not a proper sub- 
set of x, the only proper subset of {x} is @.1 Then, for any well-defined function 
f : {x} — {x}, Ø cannot be the one-to-one image of {x} under f. So singleton sets 
are finite. 


What we are given: That X is an infinite subset of a set S. 

What we are required to show: That S is infinite. 

If X = S, then S is infinite and we are done. Suppose X # S. Since X is infinite, 
then there exists a one-to-one function f which maps X onto f[X] c X. Since 
f[X] C X C S, the set S is the pairwise disjoint union of the three sets S — X, 
X — f|X] and f[X]. Define the map g : S > (S — X) U f[X] as follows: 


x if crzES-xX 
g(x) = 


f(x) if tex 


Then g maps S one-to-one onto (S — X)U f[X], a proper subset of S. So, by defini- 
tion, S is an infinite set. 


Suppose F is a finite set and X C F. We are required to show that X is finite. 
Suppose X is an infinite set. Then by the statement in part c) F must be infinite, 
contradicting our hypothesis. So X must be finite. 


Theorem 18.3 Let f : X — Y bea one-to-one function mapping X onto Y. The set X is 
infinite if and only if the set Y is infinite. 


Proof: 


(= 


) What we are given: That X is an infinite set and that f : X — Y is a one-to-one 


function mapping X onto a set Y. 
What we are required to show: That Y is infinite. 
Since X is infinite, by definition, there exists a function, g : X — g|X], mapping X 


1Even in the odd case where x € x, x can not be said to be a proper subset of x since x = x. 
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one-to-one onto a proper subset g[X] of X. Then f|, x} : g[X] > Y maps the proper 
subset g[X] of X one-to-one into Y. Since f maps X one-to-one onto Y, it has an inverse 
f! mapping Y one-to-one and onto X. Then the function 


[(floxpegef 7]: Y >Y 
maps Y one-to-one onto f|aixlg|X]] = flg[X]], a proper subset of f[X] = Y. So, by 
definition, Y is an infinite set. 


(<) Suppose Y is infinite. Then, since f~! : Y — X is a one-to-one map from Y onto 
X, by the first part of this proof, X is infinite. 


Corollary 18.4 The one-to-one image of a finite set is finite. 


Proof: The proof is left as an exercise. 


Lemma 18.5 If S is an infinite set and a € S, then S — {a} is an infinite set. 
Proof: 


What we are given: That S' be an infinite set and a € S. 
What we are required to show: That S — {a} is an infinite set. 
Since S is infinite, then there exists a one-to-one function g : S — S such that g[S] c S. 
We will show that S—{a} is infinite by exhibiting a one-to-one function h on S—{a} such 
that h|S — {a}] is a proper subset of S — {a}. Choose an arbitrary element k € S—g[S]. 
— Case A: Suppose a € g[S]. Then there is some u € S$ such that such that g(u) = a. 

- Subcase A-1: Suppose u # a. Define a function h on S — {a} as follows: 


_f gz) if xresS-—{a, u} 
ao) { k if x=u 
Since g is one-to-one on S — {a,u}, then so is h. Furthermore, h uniquely maps u 
to k € S —g[S]. So h is one-to-one on S — {a}. See that neither of the elements 
g(u) and a belong to h[S — {a}]. So h|S — {a}] is a proper subset of S — {a}. 


- Subcase A-2: Suppose u = a. Choose an element v Æ a in g[S]. Define a function 
h on S — {a} as follows: 


Boeda if xes- {a,v} 


k if gz=v 


Since g is one-to-one on S — {a,v}, then so is h. Furthermore, h uniquely maps v 
to k in S — g[S]. So h is one-to-one on S — {a}. Also see that neither g(v) nor a 
belong to h|S — {a}]. So h[.S — {a}] is a proper subset of S — {a}. 
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— Case B: Suppose a € S — g[S]. Define a function h on S — {a} as follows: 
h(x) = g|s—ta}(x) for all x € S — {a} 


Since no other element in S — {a} is mapped to g(a) by g, neither g(a) nor a belong 
to h[S — {a}]. So h|S — {a}] is a proper subset of S — {a}. 
We conclude that S — {a} is infinite. 


Theorem 18.6 Every natural number n is a finite set. 
Proof: 


The proof is by induction. Let P(n) be the property “The natural number n is fi- 
nite”. Since 0 = Ø is finite, then P(0) holds true. Suppose the natural number 
n = {0,1,2,3,...,n — 1} is finite. We claim that n+ 1 = nt = {0,1,2,3,...,n} 
must be finite. Suppose not. That is, suppose n* is infinite. By the lemma 18.5, 
(n +1) — {n} = n must also be infinite contradicting the fact that P(n) holds true. So 
P(n+1) must hold true. By the principle of mathematical induction, P(n) holds true 
for all natural numbers n. Thus, every natural number is a finite set. 


In the proof of the following corollary we require the Axiom of choice to justify a 
particular step. This is the first time we invoke this axiom. We will discuss the axiom 
of choice in length later on. For now we will simply state it and point out the step 
where it is invoked: 


Axiom of choice: For every set « of non-empty sets there is a function 
f which associates to every set A in æ an element a € A. 


At first, it seems rather harmless enough. It says that if we have a set of non-empty 
sets, then we can choose from each set one element. If the set of sets has only finitely 
many sets, then the Axiom of choice is not required. The sticky point is the one en- 
countered when the set contains infinitely many sets. If a theorem statement invokes 
the Axiom of choice in its proof, it is common practice to alert the reader to this fact 
by posting the acronym [AC].! 


Corollary 18.7 [AC] A set S is finite if and only if S is empty or it is in one-to-one cor- 
respondence with some natural number n. 


Proof: 


1The reader may see the posting of this acronym as a challenge by the author asking: “Is it possible to 
prove this statement without invoking the Axiom of choice?” . 
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(<=) Suppose S is empty or is the one-to-one image of a natural number n. Since Ø is 
finite and every natural number n is finite, then S must be finite (by Corollary 18.4 and 
theorem 18.6). 


(=) Conversely, suppose S is a non-empty finite set. 

We are required to show that there exists a natural number n which can be mapped 
one-to-one onto S. 

Suppose not. That is, suppose there does not exist a natural number n which maps one- 
to-one onto S. We claim that S must then be infinite, contradicting our hypothesis. 


Proof of claim: We prove the claim by constructing a one-to-one function f : N > S 
which maps N into S. 


— Choose an element so in S to form the subset Sı = {so} of S. Define the function 
f : {0} — {so} as f(0) = so. Then S — Sı is non-empty, for if it was empty, then 
S = Sı would be the one-to-one image of {0} under the function f contradicting the 
fact that S is not the one-to-one image of a natural number. So we can choose an 
element sı from S — Sı to construct the subset S2 = {sọ, s1}. Define the one-to-one 
function f : {0,1} — {so, sı} as f(i) = s; for i = 1, 2. 

— Suppose we have inductively constructed the subset Sn = {50, $1, $2,---,;Sn—1} of S 
where f : {0,1,...,2—1}— Sn is the one-to-one function defined as f(z) = s;. Then 
to avoid a contradiction, S — Sn must be non-empty. The Axiom of choice provides us 
with the choice function k : A(S) — S which allows us to choose from each set S — Sn 
an element sn from which we define the one-to-one function f : {0,1,...,n} —> Sn+1 
defined as f(i) = s; ifi <n and f(n) =k(S — Sn) = sn. We can in this way “induc- 
tively” construct a one-to-one function f : N — S mapping N into S. Then S contains 
a one-to-one image f[N] = {sọ0, 51, 52, 53,...} of N. By part c) of theorem 18.2, S must 
be infinite. This contradicts the part of our hypothesis in which S was declared to be 
finite. The source of this contradiction is the statement “there does not exist a natural 
number n which maps one-to-one onto S.” So any finite subset is the one-to-one image 
of some natural number. 


A few words on the “inductively” constructed function in the above proof. In the 
proof of the corollary, we have constructed a one-to-one function f : N — S by 
defining f(z) for one number i at a time. For each n, the value of f(n) depends on 
the values of f(z) for each i < n. This is because the choice function k assigns to the 
set S—{s9, $1,.--, Sn—1} an element sn. The value of f(n) is then set to be equal to sn. 


We conveniently declared that an “inductively constructed function f” is produced 
in this way, as if it was clear that this process will automatically produce well-defined 
functions. Even though it seems like a reasonably safe method for constructing func- 
tions, we should not be blind to an element of uncertainty involved in this process. 
It does not immediately follow from the definition of a function that this method for 
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constructing functions will always produce a function. If one asserts that this is obvi- 
ous, then why not produce a proof that shows us how “obvious” it really is. We will 
immediately state the theorem which guarantees that functions constructed in this 
way are valid but defer its proof to the end of this section to avoid digressing from 
our discussion of finite and infinite sets. 


Theorem 18.8 The recursive function theorem. Let S be a set. Let k: A(S) > S bea 
function on Y(S) and f C Nx S be a relation. We write f(n) = a if and only if (n,a) € f. 
Let m € S. Suppose the relation, f, satisfies the two properties 


{ f(0) =m = (0,m) = (0, f(0)) € f 
(n, f(n Ef => (n+1,k(S—{f(0), f0). f} = (+1, fln+1)) € f 


Then f is a well-defined function on N. 


Proof: The proof appears at the end of this section. 


We have shown that “counting” the elements in a finite set S comes down to de- 
termining which natural number n is mapped one-to-one onto S. We are essentially 
assigning to each of the n elements of S the labels 0,1, 2,3,...,2— 1. The corollary 
above shows that we could have defined finite sets as follows: 


Definition: A set S is a finite set if and only if it can be mapped one-to-one 
onto some natural number n. If we say that 


“the finite set S contains n elements” 


we mean that S is the one-to-one image of the natural number n. So 
“S is a finite set” and “S contains n elements for some n” are equivalent 
expressions. A set is an infinite set if it is not the one-to-one image of some 
natural number. 


All the definitions and theorems stated and proved above would logically follow from 
this definition of finite sets. The following theorem provides another characterization 
of infinite sets. 


Theorem 18.9 [AC] A set S is an infinite set if and only if it contains a one-to-one image 
of the set of natural numbers N. 


Proof: 
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(<= ) If S contains a subset U which is a one-to-one image of N, then U is infinite (by 
theorem 18.3) and so S is infinite (by theorem 18.2). 


(=) The proof is left as an exercise. (The proof mimics the proof of Corollary 18.7.) 


Theorem 18.10 [AC] If the set S is a finite set and f : S — X is a function, then f|S] is 
finite!. 


Proof: 


Suppose the set S is a finite set and f : S — X is a function mapping S into some 
set X. We are required to show that f[S] is a finite set. Suppose f[S] is an infi- 
nite set. Then there exists a function g : N — f[S] mapping N into f[S] (18.9). Say 
g(N] = {9(0), 9(1), 9(2), 9(3),...} C f[S]. Then for each i € N, f(g(2)) is a non-empty 
subset of S. For each i € N we can choose an element s; from f“ (g(i)) (the Axiom of 
choice allows us to choose an infinite number of elements in this way). So S contains 
a subset {51, $2, 53,...} of distinct elements. Let h : N — S be the function defined as 
h(i) = si. This set is infinite since it is a one-to-one image of N under the function h. 
Since S' contains an infinite subset, then it must be infinite (by 18.2). A contradiction! 
So f[S] is a finite set. 


Theorem 18.11 If a set S contains n elements, then A(S) contains 2” elements”. Hence, 
if a set S is a finite set, then the set Y(S) is finite. 


Proof: 


The proof is by induction. 


For a natural number n, let Sn denote a subset of S which contains n elements. That is, 
So = Ø, Sı = {s0}, So = {80, 51},..-, Sn = {50, $1,---,5n—1}. For each natural number n 
let P(n) denote the statement 


“The power set P (Sn) contains 2” elements” 
Base case: If n = 0, then So = Ø and so A(So) = {Ø} contains 1 = 2° element so P(0) 
holds true. 
Inductive hypothesis: Suppose P(n) holds true. That is, suppose that for any set of n 


elements the set A(S,,) contains 2” elements. Let Sp41 = {50, 51, 52,---,5n} be a set 


1Note that f need not be a one-to-one function for this to hold true. 


2If n = 0 we define 2° = 1,2! = 2° x 2. If nm is a natural number other than 0 we define 2” = 
2X2X: -x2 
—_— k 


n times 
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containing n + 1 distinct elements. Then if Sn = {s9, $1, $2,...,Sn—1}, by the inductive 
hypothesis, we can express 2 (Sn) 


P(Sn) = {Up, Ui, Uo, Be st Ugn_1} 


where the U;’s represent all distinct subsets of Sn; we will suppose that Up = Ø and Ugn_1 = 
Sn. Since the elements of the set S,,41 are distinct, then sn Z Sn, and so {sn} Z A(S,). 
Then Y(S,) C A(Sn41). For each i = 0 to 2” — 1 define V; = U; U {sn}. We see that 
P (Sn) N {Vo, Vi,..-, Varn_i1} = Ø since every V; contains the element sn. Furthermore, 
every element in A(S;,41) is accounted for in A(S,,) U {Vo, Vi,..., Vorn_i}. Then 





P Sia) = {U0, U1, U2,- - -, U2n—1, Vo, Vi, Voy -, V2n—1} 


We see that A(S,41) contains 2” x 2 = 2+1 elements. So P(n + 1) holds true. 
By the principle of mathematical induction, P(n) holds true for all natural numbers n. We 


conclude that for any set S which contains n elements, the set Z (S) contains 2” elements. 
Thus, if S is finite, then so is A(S). 


18.5 Proof of the recursive function theorem. 


A recursively defined function f : N — S (on N) is a function which is defined one 
term at a time. The process begins by defining f(0) at 0. Then, for each n > 0, the 
value of f(n) is determined based on the values previously assigned to each of f(0), 
f(), f(2), ..., f(n — 1). The theorem explicitly states the conditions under which 
this method of defining a function is valid. 


The recursive function theorem: Let S be a set. Let k: P(S) — S be a function 
mapping subsets of S to elements of S and f C N x S be a relation. We write 
“f(n) =a” if and only if (n,a) € f. Let m € S. Suppose the relation f satisfies the 
two properties 


(0, m) = (0, f(0)) € 


f 
(n +1, (S= {£(0), FO), <- #(n)}) 
=(n+1,f(n+1)) Ef 


=m => 
(n, fin) Eef => 
Then f is a well-defined function on N. 
Proof: 


Let Z be a class of relations R in N x S which contain (0, f(0)) = (0, m) and satisfy 
the condition: 


{(n, f0) :i<n}j ER => (n+1,k(S—{f(0), fQ),.--,f(m)}) 
= (n+ 1,f(n+1)) ER 
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Now Z is non-empty since it contains N x S. Let f* = prey R. This means that 
f* is the smallest set of ordered pairs satisfying the conditions described for Z. The 
relation f* looks something like 


F% = {(0, F(O)), (1, k(S — {F(0)})), (2, OS — {F(0), FD) + 
We claim that f* is a function mapping N into S. 


Proof of claim: We first establish that dom f* = N. We first note that 0 € dom f*. If 
n € dom f*, then (n, f(n)) € f*. This implies 


(n+1,k(S—{f(0), F0)... fh) =(n+1fn+ Def 


and so n+ 1 € dom f*. Hence, by induction, the domain of f* is all of N. We now 

proceed to the proof of the claim. 

The proof of the claim is by the second version of mathematical induction. Let P(n) 

denote the statement “[(n, x) € f* and (n,y) € f*] => [x = yl”. 

— Inductive hypothesis: Suppose P(m) holds true for all natural numbers m < n. 
That is, (m, x) € f* and (m,y) € f* implies x = y. We will show that given our 
hypothesis, P(n) must hold true. 

* Suppose not. Suppose (n,x) € r and (n,y) € f* where x 4 y. Let U = 
f* —{(n, y)} (the set f* take away the element (n, y)). Then we easily see that 
U is one of the relations in Z. The fact that U is strictly smaller than f*, 
previously declared to be the smallest of the relations in Z, is a contradiction. 
Then x must be equal to y. We conclude that P(n) holds true as required. 

By mathematical induction, P(n) holds true for all n. So f* is a well-defined function 

as claimed. 

We will now show that f* is unique. Let g be another function satisfying the conditions 

(0, m) = (0, 9(0)) € g and 


{(n,g(i)):i < n} Cf" > (n+1, k(S— {g(0), g(1),...,g(n)}) = (n+1,g(n+1)) € f* 


We prove uniqueness by induction: Let P(n) denote the statement “f*(n) = g(n)”. 


— Inductive hypothesis: Suppose P(m) holds true for allm < n. That is, f*(m) = g(m) 


for allm < n. 
Then by definition of f*, 


(n, k(S— {f(0), f0), fín- 1)})) = (n, f(n) 


(n, k(S' — {g(0), (1), .--,g(m— 1)})) 
(n, g(n)) 


Hence, P(n) holds true. 
By mathematical induction f*(n) = g(n) for all n, and so f* is unique as claimed. 
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We conclude that the function f = f* is a function which is uniquely defined by the 
given conditions. 


Concepts review: 


1. 


2. 


10. 


11. 


12. 


13. 


14. 


15. 
16. 


17. 
18. 


What is the definition of infinite set as put forward by Dedekind? 


From Dedekind’s definition of infinite set how can we show that N is infinite? 


. How do we define a finite set? 

. Is the empty set a finite set? 

. Is a subset of a finite set finite? 

. If S is an infinite set and u € S, must S — {u} be infinite? 

. If a set S has a subset which is infinite is S necessarily infinite? 


. If a set S is infinite and f : S — Y is a one-to-one mapping onto a set Y, what can 


we say about the set Y? 


. What can we say about one-to-one images of finite sets? 


If a set S is infinite and we remove two elements from this set is the resulting set 
necessarily infinite? 


We know that natural numbers are sets. Are all natural numbers necessarily finite 
sets? 


Statement: “Any finite set S is necessarily the one-to-one image of some natural 
number.” Is this statement true or false? 


Statement: “An infinite set is the one-to-one image of the natural numbers.” Is this 
statement true or false? 


Statement: “An infinite set necessarily contains a subset which is a one-to-one image 
of the natural numbers.” Is this statement true or false? 


What can we say about the image (not necessarily one-to-one) of a finite set? 


Is saying “S is a finite set” equivalent to saying “S contains n elements” where n is a 
suitable natural number? 


If S is a finite set, is it necessarily true that A(S) is a finite set? 


If S contains 6 elements how many elements does Y(S) contain? 
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EXERCISES 
1. Prove that the one-to-one image of a finite set is finite. 
2. Prove that: 
a) Z is an infinite set. 
b) Q is an infinite set. 
c) R is an infinite set. 
3. Prove that if S is infinite, then S x S is infinite. 
4. Prove that if S is finite, then S x S is finite. 
5. Prove that if S and T are infinite, then SU T is infinite. 
6. Prove that the union of two finite sets is finite. 
7. Prove that the union of a finite set of sets is finite. (You may use the result in 
question 6 combined with a proof by induction on the number of finite sets.) 
8. Prove that if S UT is infinite, then either S or T is infinite. 
9. Prove that if the set A is infinite and B is any set, then the set A x B must be 
infinite. 
10. Prove that if F is a finite subset of an infinite set S, then S — F is infinite. 
11. Prove that if a set S is an infinite set, then it contains a one-to-one image of the set 


of natural numbers N. 
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19 / Countable and uncountable sets. 


19.1 


19.2 


Summary. In this section we define the words “equipotent sets”, “countable 
sets” and “uncountable sets”. We show that Q and Z are countable since we 
can produce a one-to-one correspondence between each of these and N. We also 
show that Nx Z and NxN are countable. Countable unions of countable sets are 
shown to be countable. We also show that no such one-to-one correspondence 
can exist between R and N and so R is uncountable. 


Can we compare infinite sets as we do finite sets? 


“Finite sets are precisely those sets which are in one-to-one correspondence with some 
natural number n” is one of the simplest and most intuitive characterizations of finite 
sets. It essentially allows us to characterize sets according to their size. For example, 
suppose the two sets Sp and Sm can be mapped one-to-one and onto the natural 
numbers n and m, respectively. We can declare Sm to be larger than Sn if and only 
ifn C m. We can declare them to be the same size if and only if n and m are the 
same natural number. Note that declaring two sets S and T to be the same size does 
not mean that they are equal. It simply states that they are both one-to-one images 
of the same natural number. 


Can the described method for comparing finite sets be used to compare infinite sets? 
One might ask why we would want to compare infinite sets in this way and argue that 
“infinite sets need no comparing since, intuitively, they are all as big as a set can be”. 
But our intuition is not always reliable, particularly when referring to infinite sets as 
they are defined in the ZF'C-universe of sets. Verifying whether the infinite sets N, Z, 
Q and R are one-to-one images of each other is a question worth investigating. 


Equipotent sets. 


One of the set theory axioms states that two sets are equal provided they contain 
the same elements. We also say that two finite sets A and B (not necessarily equal) 
which are both one-to-one images of the same natural number n are the same size, 
or contain the same number of elements. If two sets A and B contain n elements it 
of course follows that these two sets are one-to-one images of each other. Any two 
infinite sets S and T can also be one-to-one images of the other. The set N and the 
set of even natural numbers is an example. We introduce a term used to describe this 
relation between two sets. 


Definition 19.1 Two sets, A and B, are said to be equipotent sets if there exists a one-to- 
one function, f : A — B, mapping A onto B. If A and B are equipotent we will say that 
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“A is equipotent to B” or “A is equipotent with B”. 


19.3 


So equipotent finite sets are precisely those finite sets which are equipotent to the same 
natural number. We know that for finite sets A and B, “A is equipotent to a proper 
subset of B” and “A is smaller than B” are equivalent statements; we cannot however 
say that an infinite set A which is equipotent to to a proper subset of a set B is neces- 
sarily “smaller” than B. For example, even if the set N is easily seen to be equipotent 
to the proper subset, {0,4,8,12,...,}, of itself (via the function f(n) = 4n), we 
instinctively hesitate to say that N is a “smaller” set than {0,2,4,6,...,} or that 
{0, 2,4,6,...,} is smaller than N. The words “smaller than” seem to have a precise 
meaning only when discussing finite sets. 


Countable sets. 


We will encounter many kinds of infinite sets. It will be helpful if we can categorize 
infinite sets into subfamilies of equipotent sets. We will be particularly interested in 
those infinite sets which are equipotent with N. Such sets are said to be countably 
infinite sets. 


Definition 19.2 Countable sets are those sets that are either finite or equipotent to N. 
Infinite countable sets are said to be countably infinite. Those infinite sets which are not 
countable are called uncountable sets. 


So the adjective “countable” means “to be a one-to-one image of some subset of N”. 
Does it make sense to speak of a proper class of elements which is “countable”? The 
Axiom of replacement (A7) guarantees that all “countable classes” are sets. To see 
this we recall the Axiom for replacement. 


Axiom of replacement: Let A be a set. Let (x,y) be a formula which 
associates to each element, x, of A an element y in such a way that whenever 
both (x,y) and ġ(x, z) hold true, y = z. Then there exists a set B which 
contains all elements y such that (a, y) holds true for some x € A.t 


This axiom dictates that if A is a set and B is a non-empty class of elements and 
f C Ax B isa relation which satisfies the property “(z,y) E€ f and (a, z) € f, then 
y = 2”, then there exists a set C which contains the elements, f(x), for all x € A. 
This axiom guarantees that if A is a set and f[A] contains only elements, then the 


1This axiom is more often expressed as the Replacement axiom schema since it is in fact many axioms 
each differing only by the formula ¢ it refers to. So to be more precise, given a formula ¢ in set theory 
language, we would refer to it as axiom A7(¢) rather than A7. 
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functional image, f|A] = {x : f(a) = x for some a € A}, is a set. So any class which 
is a one-to-one image of a subset of N must be a set. 


Our definition of countable sets does not guarantee that uncountable sets exist. We 
simply stated that those sets which are not countable (if any exist) will be called “un- 
countable sets”. We will carefully study the sets we have encountered to this point 
and determine which ones are countable and which ones are not. Recall (from theorem 
18.2 a) ) that the empty-set, Ø, is finite and so, by definition, is countable. Of course, 
the set, N, of all natural numbers is equipotent to itself and so is countable. Since no 
element of N is infinite every natural number is countable. We now investigate the 
set of all integers. 


Example: Show that the set Z of all integers is countable. 


Solution: Define the function f : N — Z as follows: 


-2H if n is odd 
5 if n is even 


fin) =f 


It is easily verified that f maps N one-to-one onto Z. For example, 


PO) = 0 
faji =al 
f2) = 1 
fB) = -2 
fa) = 2 


So Z is an infinite countable sett. 


There are quite a few general statements that we can state about countable sets. We 
will find it very useful to know that subsets of countable sets and images of countable 
sets are countable. 


Theorem 19.3 A subset of a countable set is countable. 

Proof: 
What we are given: That S is a countable set and T is a subset of S. 
What we are required to show: That T is countable. 


Case 1: If S is finite, then by theorem 18.2 d), every subset of S' is finite and so T is 
countable; we are done. 


1The reader should note that this is just one of many ways of proving that Z is countable. 
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Case 2: Suppose both S and T are infinite. 


Since S$ is countable it is a one-to-one image, say h|N], of N. Then the function h : 
N — S can be expressed as h(i) = x; € S. We can then express S as a sequence 
S= PROM ty Calin ee Eney 
We are required to show that T is countable. This is done by constructing a function 
f:N— T mapping N one-to-one onto T as follows: 
— Let g: A(N) —N be the function on A(N) which maps any subset of N to its least 
element. Since N has been shown to be well-ordered the function g is well-defined. 


- We recursively define the function m : N — N as follows: 


š 

= 
Š 

2 
| 


g({ieN: a; € TY) 
m(k) = g({i EN:n eT- {£m(0); Em(1) o Em(k-1))) 


By theorem 18.8, m : N —> N is a well-defined function. Since T is infinite, the 
domain of m is N. Furthermore m is a one-to-one strictly increasing function. 


- Note that {£m :i E N} CT. We claim: That {£ma) : i € N} = T: Suppose 
zk ET. Let U = {m(i) € N: m(i) < k}. Then 


gi EN: a, ET -— {Ima :m(i) € U}}) =k 
So £k € {Xmci) : i E N}. We conclude that {£ma) : i € N} =T as claimed. 


— Define the function f : N > T as f(t) = Emi). Since f is one-to-one and onto T, 
then T is countable. 


19.4 More examples of infinite countable sets. 


Showing that an infinite set is countable can be a challenge since it requires con- 
structing a function mapping N one-to-one onto a set. The above result stating that 
“subsets of countable sets are countable” is a useful tool for showing that some infi- 
nite sets are countable without specifically exhibiting a function which maps N onto it. 


We provide examples of other sets which are countable. 
Example 1. Show that the set N x Z is countable. 


Solution: Define the function f : N x Z —> Z — {0} as follows: 
f(m, n) = 2” (2n — 1) 


We claim that the function f is onto Z — {0}: 
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— Let z be any non-zero integer. Then we can factor at most a finite number 
of 2’s, say m 2’s for some integer m (m possibly equal to 0), leaving behind 
a single (either positive or negative) odd factor 2n — 1 for some integer n. So 
z = 2™(2n — 1). Thus, there exists an ordered pair (m,n) € N x Z which is 
mapped to z. So f is onto as claimed. 


We claim the function f is one-to-one: 


— Suppose x = y in Z— {0}. Since z and y are integers there exists natural numbers 
m and n and integers s and t such that! 


x = f(m, s) = 2" (2s — 1) = 2” (2t — 1) = f(n,t)= y 
Suppose, without loss of generality, that m > n, then 


2™(25—1)=2"(2t-1) > 2(2s—1)=2w4-—1 


=> m =n and 2s — 1 = 2t — 1 (RHS = odd > m -n = 0) 





=> s=t 
=> (m,s)=(n,t) 


So f is one-to-one as claimed. 
We have shown that f is both one-to-one and onto. Thus, the sets N x Z and Z — {0} 
are equipotent. We have shown that Z is countable. Since Z — {0} C Z, then, by the 
previous theorem, Z — {0} is also countable. So there exists a one-to-one function g 
mapping Z — {0} onto N. So g~!.f~! maps N one-to-one onto N x Z. So N x Z is 
countable, as required. 


Example 2. Show that the set of rational numbers, Q, is countable. 


Solution: We will represent all rational numbers in the form 
Q={m/n:meEN,ne€ Z— {0}, m/n is irreducible. } 


The rational numbers in this set are irreducible so a/b = c/d if and only if a = c and 
b = d. We define the function f : N x (Z— {0}) — Q as follows: 


f(m,n) = m/n 


This function is clearly one-to-one and onto and so N x (Z—{0}) and Q are equipotent. 
Since N x (Z—{0}) c Nx Z and N x Z was shown to be countable, then N x (Z— {0}) 
is countable. Thus, Q is countable, as claimed. 


‘Note that any non-zero integer can be expressed as a product 2™(2n — 1) for some natural numbers m 
and n. For example, suppose we are given the integer 1584. If we factor out as many 2’s as possible from 
1584 we obtain 2* and we are left with an odd number 2-50 — 1. See that 1584 = 2*(2- 50 — 1). 
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Example 3. Show that the set N x N is countable. 
Solution: Since N x Z is countable and N x N CN x Z, then N x N is countable. 


Theorem 19.4 Any finite product, N x N x --- x N, of N is a countable set. 
Proof: 


The proof is by induction. It follows from the statement in example 3. The details are 
left as an exercise. 


19.5 Countable unions of countable sets. 


How many countable sets can we join together and still obtain a countable set? The 
following lemma is the first step towards showing that joining together a countable 
number of countable sets will always result in a countable set. The lemma guarantees 
that the image of a countable set is countable. 


Lemma 19.5 Suppose f maps an infinite countable set A onto a set B = f[A]. Then B is 
countable. 


Proof: 


What we are given: The set A is countable, f : A — B maps A onto the set B. What we 
are required to show: That B is countable. 


Since A is countable we can index the elements of A with the natural numbers. Let 
A = {a;: i € N}. For each b € f[A] = B let ap be the element in f€ ({b}) such that 


b* = min{i E€ N: a; E fF ({b})} 


This minimum element exists since N is well-ordered.! Since {ap : b € B} C A, then 
{ap : b € B} is countable. (Subsets of countable sets are countable.) Then the function 
g : {a : b € B} — B defined as g(a») = b is one-to-one and onto B. So B is countable 
as required. 


Theorem 19.6 Let {A; : 71 € S C N} be a countable set of non-empty countable sets Aj. 
Then (;es Ai is countable. 


1Note that the axiom of choice is not required for this. 
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Proof: 


What we are given: That the sets in {A; : i € S C N} are all countable sets. 
What we are required to show: That U;<¢ Ai is countable. 


Since each set A; is countable, then we can index the elements of each A; with an initial 
segment T; of natural numbers or with all elements of N. For each i, let Aj = {ag j) : j € 
T; C N}. 

We will define a function f : Ujeg Ai > S x N as follows: f(aq;)) = (i, j). 

We see that f maps Ujes-A; one-to-one into S x N. Since f [UicsA4] C Sx NCNXN, it 
is countable. Then UjesA; is the one-to-one image of the countable set f [UicsA;] under 
the inverse map f~t. 


We conclude that UiesA; is countable. 





19.6 The set R of all real numbers is uncountable. 


It is only after numerous attempts to show that the set of all real numbers is countably 
infinite, that mathematicians turned their attention towards showing that R is not 
a countable set. We can of course not say that since our very best mathematicians 
are unable to prove that R is countable, then R must be uncountable and leave it 
at that. We present a clever proof devised by Georg Cantor (1845-1918). Cantor 
successfully shows that no one-to-one image of N in R can be comprised of all real 
numbers. That is, for every one-to-one function f : N — R, the subset, R — f[N], of 
R will never be empty. This way of proving that R is uncountable is referred to as 
Cantor’s diagonalization method. 


Theorem 19.7 The set R of all real numbers is uncountable. 


Proof: 


We will show that the open interval (0, 1) is not countable. As a consequence, it will be 
impossible for R to be countable, since subsets of countable sets have been shown to be 
countable. 

Proof by contradiction. 

Suppose f : N — (0,1) is a one-to-one function mapping N onto (0,1). This means 
that we can index the elements of (0,1) with the natural numbers as follows: (0,1) = 


{x0, £1, £2, %3,...,} where x; = f(i). We claim that at least one real number does not 
belong to f[N] and so f is not “onto” (0,1): 
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— We write out each real number as an infinite decimal expansion: 


Lo = 0.11412413014015... 

zı = 0.a21422023024095... 

z2 = 0.431432033044045..- 
In] = 0.An1An2AnZAn4sAns -- - 


We will construct a non-zero real number 0.b,b9b3b4... between 0 and 1 which is not 
accounted for in this list. 


- For each å, if ay € {0,1,2,3,4} let b; = 7. If ay € {5,6, 7, 8,9} let b; = 2. Note 
that there is nothing special about the numbers 7 and 2. We could have chosen 
another pair of integers between 0 and 9. But not 9’s since a number containing 
an infinite string of 9’s can produce a real number which has two infinite decimal 
representations. 

- So the real number x = 0.6;b2b3b4... is a string of 2’s and 7’s. We claim that 
x is not in the set {xo, £1, £2, 73,..., } said to contain all real numbers in (0, 1). 
Verify that |b; — a1ı| > 1, |b2 — a22| > 1, and more generally, |b; — a;;| > 1 for all 
i and so the real number x cannot be any one the numbers in the list as claimed. 


— So the function f is not onto (0,1) as claimed. 


Note that this does not only prove that this particular function is not onto; it also proves 
that all one-to-one functions f : N — (0,1) which claim to be onto cannot be so. Since 
(0, 1) is not countable, then R cannot be countable. 


Maybe for philosophical reasons, some readers may find it difficult to accept that 
the infinite set R is not a one-to-one image of N even though they can point to no 
obvious errors in Cantor’s proof. Skeptical readers may find some comfort in learn- 
ing that even very skilled mathematicians, when confronted by results which appear 
counter-intuitive, may harbour some nagging doubts in spite of being presented with 
an irrefutable proof. Georg Cantor once wrote to Richard Dedekind “Je le vois, mais 
je ne le crois pas”? (I see it, but I don’t believe it) after determining that there is a 
one-to-one correspondence between all points in the plane and the set of points on a 
line. 


Tt can be shown, for example, that the rational numbers 0.04999999... and 0.05000... are different 
representations of the same rational number 5/100. But the decimal representation of a rational a/b is 
unique provided we do not allow a tail end of 9’s in our representation of this number. 

2Jean Cavaillès, Philosophie mathématique, p. 211. 
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Having now convinced ourselves that the set of all real numbers is uncountable, we can 
subdivide the class of all infinite sets into two categories: the subclass of all countably 
infinite sets and the subclass of all uncountably infinite sets. We will see in the next 
section that the class of all uncountable sets can itself be divided into other major 
subcategories of infinite sets. 


Concepts review: 


1. 


2. 


What does it mean to say that two sets are equipotent sets? 


What does it mean to say that a set is countable? 


. Is the set @ countable? 


. What can we say about the image of a countable set under some function f? 


. Which of the sets N, Q, Z, N x Z, R are countable sets. 


. Is it true that the subset of a countable set must be countable? 


. How is the procedure Cantor used to prove the uncountability of R referred to? 


. What can we say about the countable union of countable sets? 


EXERCISES 


Or 


Oo OND 


. Prove that N x N is an infinite countable subset. 
. Prove that the sets N — {0} and N are equipotent. 
. Are there sets which are neither countable nor uncountable? Construct a set other 


than R which is uncountable and explain what makes this set uncountable. 


. Prove that Q does not contain a one-to-one image of R. 
. If S is finite show that A(S) is countable. 


. Prove that N contains a subset which is equipotent with Q x Q. 

. Show that (N x N) x N is countable. 

. Show that N x R is uncountable. 

. Suppose f : R — Z is defined as follows: f(x) is the smallest integer y such that 


y > «x. Is the quotient set induced on R by f a countable or uncountable set. Explain. 
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10. 


11. 
12. 
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Suppose we tried to use the Cantor diagonalization method to prove that Q is un- 
countable. Explain why this would not work. 


Is the set Y(Q) countable or uncountable. Explain why. 
Prove that the one-to-one image of an uncountable set must be uncountable. 
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20 / Equipotence as an equivalence relation. 


20.1 


Summary. In this section we first show that the equipotence relation “we” is an 
equivalence relation on the class Z of all sets. We use this equivalence relation 
to partition Z into equivalence classes. We show that for any set S, S cannot 
be equipotent to its power set P(S). This fact allows us to construct infinitely 
many distinct classes of mutually equipotent sets. We also show that for any 
non-empty set S, the two sets P(S) and 2° are equipotent. Finally, we show 
that P(N) is embedded in R and R is embedded in P(N). 


Viewing equipotence as a relation. 


Let Z ={S:S is a set} denote the class of all sets. The Axiom of class construction 
guarantees this to be a well-defined class. Now “equipotence” can be viewed as a 
relation Re on Z: a pair (A, B) E€ Y x F belongs to Re if and only if A and B are 
equipotent. So, the word equipotence conveniently refers to the property possessed by 
sets which are equipotent.! 


Notation: If two sets, A and B, are equipotent we will represent this property by, 
A ~e B. 


For example, we have previously shown that N ~e Q and N ~e N x Z; so the pairs 
(N, Q) and (N,N x Z) belong to the relation Re on Z. We can also say, for example, 
that the class 

{LSE ZF: Swe Nor Swen, nEeN} 


is the class of all countable sets. Also, for all infinite subsets A of N, N ~e A. We 
have also seen that N £e R; hence, (N, R) ¢ Re. It is natural to wonder whether Re 
is an equivalence relation on .Y%. We immediately verify that this is the case. 


Theorem 20.1 Let Z be a class of sets. The equipotence relation Re on Z is an equiva- 
lence relation on J. 


Proof: 


Reflexivity: For every set S, S is equipotent to itself. 
Symmetry: If S is equipotent to T, then T is equipotent to S. 
Transitivity: If S is equipotent to T and T is equipotent to H, then S is equipotent to H. 


1The word “equinumerous” is also used to describe two sets which are equipotent. The word “equinu- 
merosity” is also used to describe the property of sets which are equipotent. The word “equipotence” has the 
advantage of having only four syllables rather than the tongue-twisting seven syllables in “equinumerosity” . 
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20,2 
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The equivalence relation, Re, on the class Y allows the construction of subclasses of 
Z we call equivalence classes induced by Re. If S is a set we will represent the equiv- 
alence class which contains S by [S]-. We list infinitely many distinct equivalence 
classes: 


Just about every set we have discussed up to now belongs to one of these equivalence 
classes. Of course, this affirmation must be verified case by case. This gives rise to 
a compelling question: Are there any equipotence-induced equivalence classes other 
than the ones listed here? To show that there are, is one of the main objectives of 
this section. 


A few fundamental properties of the equipotence relation. 


There are far too many sets to determine, on a case by case basis, which pairs belong 
to the same equipotence-induced equivalence class and which don’t. There are, how- 
ever, a few fundamental equipotence relation properties we can derive immediately 
which will help us classify sets by equipotence. We prove some of these now. 


Theorem 20.2 Suppose A, B, C and D are sets such that A ~e B and C ~e D where 
ANC = Ø= BAD. Then (AU C) ~e (BUD). 


Proof: 


What we are given: A ~e B and C ~e D, ANC = Ø, BAD =Ø 
What we are required to show: (AU C) ~e (BUD) 


Since A ~e B and C ~e D, there exists one-to-one onto functions f : A — B and 
g : C — D. Define the function h : AU C —> BUD as follows: h|4 = f and hic = g. 
Since AN C = Ø = BA D, then h is well-defined, one-to-one and onto BU D. So 
(AUC) ~e (BU D) as required. 


Theorem 20.3 Suppose A, B, C and D are sets such that A ~e B and C ~e D. Then 
Ax C ~e Bx D. 


Proof: 
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What we are given: A ~e B and C ~e D 
What we are required to show: That A x C ~e B x D. 


Since A ~e B and C ~e D, there exists one-to-one onto functions f : A — B and 
g : C — D. Define the function h : A x C —> B x D as follows: h(a, c) = (f(a), g(c)). It 


is left as an exercise to show that h is a one-to-one onto function. So A x C ~e B x Das 
required. 


Corollary 20.4 Suppose A and B are infinite sets. 

1) If {A, B} c [Nẹe, then A x Be [N]-. 

2) If {A, B} c [R]e, then A x B € [R].. Hence, Rx R ~e R. 
Proof: 


1) Suppose {A, B} c [N]e. Then A ~e N and B ~e N. By the theorem 20.3, A x B ~e 
N x N where N x N is known to be countable (theorem 19.4). Hence, A x B € [N]e. 


2) Suppose {A, B} c [R]e. Then A ~e R and B ~e R and so, by the theorem 20.3, 
Ax B ~e RxR. It is easily seen that R is equipotent with {1} x R C R x R. Since 
{1} x R is uncountable, R x R is uncountable. Then A x B ¢ [N]-. We must now show 
that A x B € [R]e. This will be the case if R x R € [R]e. 

















We claim that R x R € [R] 


e 


— It is easily seen that (0,1) ~e R.1 Then, by the theorem 20.3, (0,1) x (0,1) ~e R x R. 
To show that R x R € [R]e it then suffices to show that (0,1) x (0,1) ~e (0,1). 


— For 0.4, 29732425... E€ (0,1) (ignoring those decimal expansions with infinite strings 
of 9’s) define the function f : (0,1) — (0,1) x (0,1) as follows: 


f(0.£1£2£3£425 . . .) = (0.£1£3£5L729 . . . , 0.L2L4L6L4T8 -..) 


The function f is onto: 


- Let (0.a1a2a3a44a5 S 0.b1b2b3b4bs5 . . .) € (0, 1) x (0, 1). Then 
f (0.a1b1a2b2a3b3 ste .) = (0.a,a2a34445 ee; 0.b1b2b3b4b5 T :) 


so f is onto (0,1) x (0,1). 
The function f is one-to-one: 


Tx 


'The function f(x) = tan (72) maps the open interval (0, 1) one-to-one onto (0,00). If g(x) = Ina, gof 
maps (0,1) one-to-one onto (—oo, co) = R. 
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- Let (0.a1a2a304 ...,0.b0;b2b3b4 . . 3) E€ (0, 1) x (0, 1). By definition of f, 
(0.a1b1a2b2a3b3 . . .) is the only element that can be mapped to 
(0.a1a2a344 . . . , 0.b1b2b3b4 . . .). Hence, f is one-to-one. 


So (0,1) ~e (0,1) x (0,1) as required. 


We have shown that not only is R x R uncountable, R x R € [R]e. Since Ax B ~e RxR, 
Ax Be [Rle. 


Example: Show that (—7/2,7/2)x Q ~e R x N. 
Solution: 


Since the function f(x) = tana maps the interval (—7/2,7/2) one-to-one and onto 
R while Q has been shown to be equipotent with N, then, by the above theorem, 
(—7/2,7/2)x Q ~e R xN. 


20.3 Products of countable sets. 


Given two infinite countable sets S and T, using theorem 20.3, we can write 
S xT we Nx Nye N 


So the product of two infinite countable sets is countable. If {A; : i = 0, 1,2,3,..., n} 
is a finite set of countable sets then the expression Į [;—o A; is defined as: 


[[ 4: = 40 x 41 x Ao X tet X Ån = { (20, T1, £2, .--, En) : £; € Ai} 
i=0 


We can prove the following slightly more general result. 


Theorem 20.5 Let {A; : i € N} be a (countable) set of non-empty countable sets. Then 
[Tio Ai is countable for all n. 


Proof: 


What we are given: {A;:7 € N} is a set of countable sets. 

What we are required to show: Ap x A; X Ag x --- X Ay is countable. 

We know that the product of any two non-empty countable sets is countable: 
We prove the statement by induction. Let P(n) be the statement 


n 
“TI A; is countable ” 
i=0 
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— Base case: Since Ag is countable, then P(0) holds true. 
— Inductive hypothesis: Suppose P(n) holds true. Then [[;—o A; is countable. Now 


n+1 


II Aj ~e 1 a) x Án+1 
i=0 i=0 


is a product of two countable sets. We have initially shown that products of pairs of 
countable sets is countable. Hence, the product Bua A; is countable. So P(n + 1) 
holds true. 


By the principle of mathematical induction Į [;—o A; is countable for all n. 


The reader should be careful not to generalize the above theorem when it comes to 
Cartesian products. It does not say that “The Cartesian product of countably many 
countable sets is countable”. This statement does not hold true in general. We will 
soon witness infinite products of countable sets which are not countable. 


20.4 Adding countable sets to infinite sets. 


We will now show that if a set, S, is infinite and another set, T, is countable (finite 
or infinite), then S UT € [S]e. Adding countably many elements to an infinite set S 
will always result in a set which is equipotent with S. 


Theorem 20.6 Suppose S is an infinite set and T is a countable set such that SN T = Ø. 
Then 

S ~ve SUT 
Proof: 


What we are given: That T is countable, S is infinite and SNA T = Ø. 
What we are required to show: That S and S UT are equipotent. 
It suffices to construct a one-to-one function f : SUT — S mapping SUT onto S. 
— Since S is infinite, then it must contain a countably infinite subset, say, X. (By theorem 18.9.) 
— Since T is countable and X is countably infinite, then, by theorem 19.6, TU X is 
countably infinite. 
— So there exists a one-to-one onto function g : TUX — X mapping TUX onto X C S 
(since both TU X and X belong to [N]e.) 
— We will now construct a function f : SUT — S as follows: 
{ x if xEeS—-xX 


f(a) = giz) if wxETUX 


We see that f maps S U T one-to-one and onto S, as required. 
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Example: Let J denote the set of all irrational numbers. Show that J € [R]e. 


It was shown that the set of all rational numbers Q is countably infinite. If J was 
countable, then by the theorem above, R = JUQ would be countable, a contradiction. 
So J is uncountable. We claim that J € [R]e. 


— Since JN Q = Ø, then, by theorem 20.6, J ~ve JU Q = R. We conclude that 
J € [Rle. 


Equipotence classes of power sets. 


Given any set S, the Axiom of power set allows us to construct a new set, A(S'), by 
gathering together all subsets of S and viewing these subsets as the elements of Y(S). 
We have also shown that if a set S contains n elements, then its power set, Y(S), 
contains precisely 2” elements. Obviously, at least for finite sets S, S and A(S) are 
not equipotent. We would like to see whether this rule generalizes to infinite sets. 


We begin by showing that pairs of sets which are equipotent produce equipotent power 
sets. 


Theorem 20.7 If the sets A and B are equipotent, then so are their associated power sets 


P(A) and P(B). 


Proof: 


Given that A and B are equipotent, there exists a one-to-one function f : A — B mapping 
A onto B. We define the function f* : P(A) — P(B) as follows: 


f(T) =M $ fT] =M 


Claim: The function f* is onto A(B). 
Proof of claim: Let M € P(B). If M = Ø, then f*(@) = f[o] = Ø. Suppose M is 
a non-empty subset of B. Since f is onto B, M C f[A]. Then f[f~[M]] = M. So f* 
maps the element f€ [M] in F(A) to the element M in A(B). We conclude that f* 
maps Y(A) onto Y(B). 

Claim: The function f* is well-defined. 


Suppose U and V are distinct elements of Y(B) such that x € U — V # Ø. Since f 
is one-to-one and onto, f* [{x}] is a singleton set {y} contained in f~[U] C A. Since 
c€¢V,y¢f-[V]. Then y € f€ [U] — f“ [V]; this implies that f* maps the distinct 
elements fT [U] and f€ |V] to U and V respectively. 


Claim: The function f* is one-to-one. 
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Suppose S and T are distinct elements of A(A) where S is non-empty. Suppose that 
xz Ee S-—T# Ø. Since f is one-to-one, f(x) g f[T]. Then f(x) € f[S] — f[T]; this 
implies f[S] 4 f[T]. We conclude that f*(S) 4 f*(T). So f* is one-to-one. 

So P(A) and A(B) are equipotent. 


SS =. 


At this point, we know of only two equipotence-induced equivalence classes of infinite 
sets. They are [N]e and [R]e. We wonder: Is A(N) a countable set? That is, does 
P(N) belong to [N]e? Let’s gather together a few proven facts and possible deductions 
which can be made from these: 


— We have already shown that that Q ~e N. 
— It then follows from the theorem above that F(Q) ~e P(N). 


— Now R was defined as the set of all Dedekind cuts; Dedekind cuts were seen to 
be elements of A(Q). 


— Hence, R is equipotent to a subset of Y(Q). 

— Since R is uncountable, then A(Q) must also be uncountable. 

— It must then follow that A(N) ¢ [N]-. That is, A(N) is uncountable. 
— We would now have to verify whether A(N) € [R]e. 


So, just like A(n) £e n for all natural numbers n, A(N) £e N. 


A general follow-up question might be: Is it possible for any infinite set, S, to be 
equipotent with its power set Y(S)? 


We will show that the answer to this question is, no! That is, if S is infinite, 


S€|P(S)le. 


Theorem 20.8 Any non-empty set S is embedded in its power set Y(S). But no subset 
of S is equipotent with Y(S). 


Proof: 


What we are given: That S is a non-empty set. 
What we are required to prove: 
1) That S is embedded in A(S) 
2) That Y(S) is not equipotent to K for any K C S. 


1) For any element x € S, {x} € P(S), so the function f : S > P(S) defined as f(x) = {x} 
maps S one-to-one into Y(S). So S is embedded in A(S), as required. 
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2) Proof by contradiction. Suppose K C S such that K ~e A(S). Note that K 4 Ø since 
P(S) 4 Ø. Then there exists a function 


g: K > P(S) 


mapping K one-to-one onto P(S). We claim that this leads to a contradiction. 
— If x € K, g(x) is seen to be an element of Z (S) and therefore is a subset of S. 
— Either x € g(x) or x ¢ g(x). Let T be the subset of K defined as: 


T={rek:2x¢g(x)} 


Note that T C K C S and so T € A(S). 
— Since the function g : K > Y(S) is onto Z (S) there must be some element in K, say 
y, such that g(y) =T. Let’s determine whether y is in T or not: 

- Ify E€ T, then, by definition of T, y Z gly) =T. This makes no sense, so y ¢ T. 
On the other hand, if y ¢ T, then y € g(y) = T. This contradicts the pre-established 
fact that y ¢ T. 


The source of this contradiction is the supposition that g : K = Y(S) is one-to-one and 
onto. So K £e P(S) for any KC S. 


The above theorem confirms that for any set S, the equipotence-induced equivalence 
classes, [S]e and [A(S)]-, are always distinct. It also suggests that there are many 
more equipotence-induced equivalence classes than the ones listed previously on page 
190. For example, 








We will show how a never-ending list of equipotence-induced equivalence classes of 
infinite sets can be constructed. We first define the expression “properly embedded”. 


Definition 20.9 We will say that the non-empty set, A, is properly embedded in the set, 
B, if A is equipotent to a proper subset of B but B is not equipotent to A or any of its 
subsets. To represent the relationship “A is properly embedded in B” we will write 


Ae B 


If A and B are sets such that A is equipotent to a subset of B where A may, or may not, 
be equipotent to B, we will say that A is embedded in B. To represent the relationship “A 
is embedded in B” we will write 

A =en B 
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Both relations, œe and ~,.~ are easily seen to be a transitive relations on the class, 
SF, of all sets. 


If F = {S : S isa set}, we define the class € as 
E = {[S]: SE PY 


Both >e and <,.~ induce an order relation on &, which we now define. 


Notation 20.10 Let Y={S:S is a set} and & = {[S]e: SE F }. Let [A], and [B]e be 
elements of &. We write 
[Ale <e [B]e 


if and only if A >e B. We write 
[Ale <e [B]e 


if and only if A >e~ B. 


Some may suspect that <e is a non-strict partial ordering on &. The relation <e 
is easily seen to be reflexive and transitive on &. But it is not clear whether <e is 
antisymmetric on &. We see that [A]e <e [B]e only if A. B and [B]e <e [A]e only 
if Bo.~ A. But does ([A]e <e [B]e) A ([Ble <e [A]e) > ([A]e = [B]-)? Equivalently, 
does “A is embedded in B and B is embedded in A” imply that A and B are equipo- 
tent sets? We suspect that this is the case. But proving this is not a trivial matter. 
So we can not assume this to be a fact at this time. The statement which guarantees 
that this holds true is called the Schréder-Bernstein theorem. This will be the main 
topic of the next section. 


We provide a few examples. We have previously shown that any infinite set contains a 
subset which is equipotent with N, and, since N and R are known to be non-equipotent, 
N >e R; it then follows that [N]. <e [R]-. Similarly, for any natural number n and 
infinite set A, n >e N >e A and so [ne <e [Ale. 


Proposition 20.11 Let S be any set. Suppose 7°(S) = S, A1(S) = P(S) and A"(S) = 
P(P""(S)) for all n > 1. The set 


{1S le (AS Ve (Sle (8 epee IPG) extar 
forms an infinite <,-ordered chain of distinct classes in &. 


Proof: 
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Proof by induction. Let P(n) denote the statement “{[A*(S)].:i=0,1,2,...,n} forms a 
<,-ordered chain of distinct classes in &”.1 

Base case: {[P°(S)]-} = {[S]e} € £. Since {[S]e} contains only one element <e linearly 
orders {[S]e}. The base case holds true. 

Inductive hypothesis: Suppose P(n) holds true. That is, suppose {[ 2t (S)]e : i = 0,1,2,...,n} 
forms a <,-ordered chain of distinct classes in &. We are required to show that P(n + 1) 
holds true. 

By theorem 20.8, A"(S) >e A"*1(S), and so [P"(S)le <e [A"t1(S)]. Since <e is ir- 
reflexive, antisymmetric and transitive on {[A'(S)]. : i = 0,1,2,...,n}, then it must the 
case for {[F"(S)]-:i=0,1,2,...,n,n +1}. So P(n +1) holds true. 

By mathematical induction {[A"(S)].: i = 0,1,2,...,n} forms a <,-ordered chain of dis- 
tinct classes in & for each n € N. 

From this we conclude that the relation <e is a strict linear ordering of the infinite set 


{[A'(S)le:n € N}. 


The above proposition allows us to say that both {[A”"(N)]- : n = 0,1,2,...,} and 
{{A”"(R)]e:n =0,1,2,..., } form infinite <.-ordered chains of distinct classes. It will 
be interesting to determine whether these two chains have any elements in common. 
We will have the tools required to answer this question only in the next section. 


20.6 Equipotence of 2° and A(S). 


In example two on page 129, we introduced the set {1,2} equipped with the lexico- 
graphic linear ordering. This set was defined as being the set of all functions mapping 
N into the set {1,2}. It can be equivalently described as 


{1,2} = {(ao, a1, a2, a3,...,) : a; equals 1 or 2} 


which is the set of all possible countably infinite ordered strings of 1’s and 2’s. Whether 
N is mapped to {1,2} or {0,1} is not considered as being a significantly different set 
since all possible countably infinite strings of 0’s and 1’s will essentially produce a 
set which is equipotent to the set of all possible countably infinite strings of 1’s and 
2’s. Using 0’s and 1’s will allow us to represent the set, {0,1}, more succinctly as, 2. 


We can generalize the expression by replacing N with any set S. That is, if S is any 
non-empty set, 2° represents all functions which map the set S to {0,1}. For example, 
given some finite set, say, S = {3,4} we can actually list the functions in this set as: 


24) = { {(3,0),(4,0)}, {(3,1), (4, D} {3,0), (4, D} {(3, 1), (4,0)} } 


a set equivalent to the set of 4 sequences 


{ {03,04}, {13, la}, {O3, lat, {(13, 04} } 


‘Note that by the Axiom of power set, Y”(S) is a set for all n € N; hence {[A"(S)J-:n EN} C £. 
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This set has four, or 2°, elements. If S has three elements, say, S = {7,8,9} and 
we list all elements of 2° we would see that it contains precisely 2? = 8 elements. 
Verify this. It can be shown my mathematical induction that if S has n elements, 2° 
must contain 2” functions (see the exercise section). Recall that in theorem 18.11, we 
showed by induction that the power set, P(S), of any n-element set, S, contains 2” 
elements. From this fact, we deduce that, 


“If S is finite, 25 ~e A(S)” 


Question: Can we generalize this statement so that it holds true for all sets S, includ- 
ing infinite ones? 


Answer: We will convince ourselves that we can. But this will require some careful 
explaining. To help answer this question let’s consider a third way of viewing the 
elements of 2° (whether S is finite or not). Suppose f € 2°. 


— Then f€ [{1}] and fT [{0}] form disjoint subsets, say T and S—T, of S respectively. 
Note that T may possibly be empty or possibly be all of S. 


— So f is a function which maps z to 1 if and only if x € T and all other elements to 
0. 

— That is, f = yr € 2° = {0,1}%.! In fact, for every K C S, equivalently for every 
K € P(S), xg € 2°; conversely, for every f € 25, there is precisely one T C S, 
equivalently T € P(S), such that f = yr. 


Consider the function, g : A(S) — 2°, defined as: g(T) = yr. We have just shown 
that g maps P(S) one-to-one onto 2°. Hence, 2° ~e A(S). It is worth formally 
stating this important statement as a theorem. 


Theorem 20.12 For any non-empty set S, 


25 = {yp :T € A(S)} ~ve P(S) 


If S = N, then N ~e P(N) or equivalently 2N € [A(N)]e. Similarly, 27 € 
[P(A(N))]_ and 2" € [P(R)]e. 


20.7 Comparing “(N) and R 


Recall that on page 81 we introduced a function called the “characteristic function of T” represented 
as, xr. The function yr on S was defined as a function mapping T to {1} and the rest of S to {0}; it is 
precisely the same function as f. 
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We eagerly wonder: “Do the sets A(N) and R belong to the same equipotence equiv- 
alence class or not?” We know that N is embedded in R. But it is not clear how Y(N) 
relates to R with respect to the equipotence relation. The following theorems do not 
entirely answer this question, but they constitute a first step towards answering it. 


Theorem 20.13 The set, R, is embedded in A(N). 
Proof: 


Recall that Dedekind defines of the real numbers R as being a particular set of initial 
segments of Q and so R C A(Q). Since Q ~e N, then P(Q) ~e A(N) (by theorem 
20.7). Since R C P(Q) ~e P(N), R is equipotent to a subset A(N). 


We provide some background that will help follow the proof of the next statement. 
The theorem 20.12 states that 2N = {yr : T € A(N)} ~e P(N). Recall that the 
function yr : N — {0,1} is defined as 


JO ifngT 
xr(n) = 4 1 ifneT 


So, for a specific subset T of N, yr can be viewed as a sequence, {ig, 21, i2, 73,.--, }, 
of 0’s and 1’s, where ig = 0 only if k ¢ T, otherwise i, = 1. We see that there 
is a one-to-one correspondence between the set {yr : T C N} and the set of se- 
quences { {io, 21, i2, 13,.--, } sin € {0, 1}}. Equivalently, there is a one-to-one cor- 
respondence between the set, {yr : T C N}, and the set of decimal expansions, 
{ ig-tytgi3i4---: where in € {0, 1}. For example, if E = {n € N: n is even }, then 
Xp can uniquely be represented as: 
Xe = {(0,1), (1,0), (2, 1), (3,0), (4, 1), (5,0),-..,} 

_ {lo, 01, lo, 03, la, Os, oft zt 

— 1ọ.0112031405--- 

— 1.01010101--- 


IfO={neEN: nis odd }, then we can write 


{(0, 0), (1, 1), (2, 0), (3, 1), (4,0), (5,1),...,} 
— {00, 11, 02, 13, 04, Scene } 

= 0o-1102130415:-- 

— 0.10101010--- 


Xo 


Say we define a function, f* : {xr : T € A(N)} > { io-irigigia- ++: where in € {0,1}} 
as 
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f (xr) = 19-21 121314 -++ if and only if Lis = xr (n) for n EN 


For example: 


f'(xn) = 1.111111111111111---= 10/9 
f (xs) =  1.010101010101010--- = 100/99 
f (xo) = 0.101010101010101 -- - = 10/99 
f (xe) = 0.000000000000000 - - - = 0 
f (x40,33) = 1.001000000000000 - - - = 1001/1000 


We see that the maximum value in the image of {yr : T € A(N)} under f“ is 10/9 
while the minimum value in the image is 0. Also, if U Æ V, then f*(xu) 4 f (xv) so 
f“ is one-to-one on {xr : T € A(N)}. 


We are now set to prove the following theorem. 


Theorem 20.14 The set Y(N) is embedded in R. 


Proof: 


We define a function f* : {xr :T € A(N)} > { io.irizigi4: ++: where in € {0,1}} as 


f (xr) = 19-21121314 +++ if and only if diy = xr (n) for n EN 
We see that the function f“ maps {xr : T € A(N)} one-to-one into the interval [0, 10/9]. 


It follows that 
P(N) ~e (xr: T E€ A(N)} >e [0, 10/9] CR 


So A(N) is embedded in R, as required. 
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Concepts review: 


1. 


10. 


11. 


Describe the equivalence relation on the class of all sets which was discussed in this 
section. 


. What can we say about the finite union of disjoint countable sets? 
. What can we say about the Cartesian products of two countable sets? 


. If we add a countable set to an infinite set S what can we say about the set that 


results from this union? 


. The set of all irrationals is equipotent with which set? 

. If two sets A and B are equipotent what can we say about their respective power sets? 
. What is the meaning given to the expression “A is properly embedded in B”? 

. From any non-empty set S construct a set B such that S >e B. 


. Name a set which contains a copy of R but is not equipotent with R. 


If S is a set, what set of functions is equipotent with A(S) other than Y(S) itself? 


If S is a set, what does the set {yr : T C S} represent? To what set is it equipotent 
with? 


EXERCISES 


. Show that the following pair of sets are equipotent. 


a) A= (0,1) and B = (—1, 1). (These represent open intervals in R.) 
b) A= (—1, 1) and R. 


. If S = {0, {1,2}} and T = {{x}, y} write out explicitly the elements in the set Z (S) 


and P(T). 


. Show that A(N), A(Q) and A(Z) are equipotent uncountable sets. 
. Prove that the set of all non-negative irrational numbers and the set R of all real 


numbers are equipotent. 


. Show that the sets {0,1} x N is countable. 
. Let S = {a,b,c}. Show that the three sets 2°, P(S) and {yr : T € A(S)} contain 


the same number of elements by listing all their elements. 
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12. 


12. 


13. 


. Is the set NN countable? 

. Is NN embeddable in R? If so, find a suitable mapping. 
. Is R embeddable in NN? If so, find a suitable mapping. 
10. 
11. 


Prove that 2(2°) and P(A(S)) are equipotent. 


Prove, by mathematical induction, that if a non-empty set S contains n elements, 
then the set 2° contains 2” elements. 


Prove that there are infinitely many equipotence-induced equivalence classes of un- 
countable sets. 


Prove that R x R x Rx --- x R (n times) is equipotent to R for all non-zero natural 
numbers n. 


Prove in detail theorem 20.3. 
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21 / The Schr6éder-Bernstein theorem. 


21.1 
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Summary. In this section we state and prove the Schröder-Bernstein theorem. 
We then illustrate some of its consequences. In particular, we use it as a tool 
to prove that R and P(N) are equipotent. We finally show that NN and R are 
equipotent. 


Reviewing some basic properties of infinite sets. 


In the last chapter, we discussed certain properties possessed by infinite sets. We 
will now build on those results to prove more general statements about these. These 
results tend to be less intuitive since they relate to sets other than those which rep- 
resent the numbers we are accustomed to. Before we begin, we list the results from 
the last section which will serve as the main tools to prove the statements which follow. 


Recall that Z = {zx : x is a set}. 


— The countable union of countable sets is countable. 
— The finite product of countable sets is countable. 
— (A~e B) A (C ~ve D) > Ax C ~e Bx D. 

— [(S is infinite) A (T ~e N)| > (S ~e SUT). 

— (SE F) => (S =>. A(S)) 

- {xr:T E€ P(S)} ~e P(S) 

— (SE F) => 25 we P(S) 

— Row A(N) and A(N) >e~ R 


The Schröder-Bernstein theorem. 


Even if two sets are known to be equipotent, it can be quite difficult to construct a 
function which maps one set one-to-one onto the other. For example, we may suspect 
that A(N) and R are equipotent sets, but proving this by actually producing a func- 
tion which maps R one-to-one onto Y(N) could be a challenging task, one that will no 
longer be necessary once we have proved a statement called The Schréder-Bernstein 
theorem. 


Theorem 21.1 (The Schréder-Bernstein theorem) If S and T are infinite subsets where 
S is embedded in T and T is embedded in S, then S$ and T are equipotent. 


The proof is presented once we have proved the following lemma. 
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Lemma 21.2 Let T be a proper subset of the set, S, and f : S — T be a one-to-one 
function mapping S into T. Then there exists a one-to-one function, f° : S — T, mapping 
S onto T. 


Proof: 


What we are given: That T C S; that f : S — T maps S one-to-one into T. 

What we are required to show: There exists a one-to-one function f° : S — T which 
maps S onto T. 

Since T is a proper subset of S, then S — T is non-empty. 

We construct a sequence of sets {.5; : i € N} as follows: 


So = S-T 

Sı = fIS-— T] = f[So] 
S2 = PIS-T]= fiS] 
53 = PIS-T]= fiS 
Sa = f*[S—T]= f[S] 





Sa = FIS- T] = flSn] 


Let U = [ien Si- Since f maps all of S in T, for all i > 0, S; C T. Remember that 
So = S — T. The S;’s can be shown to be pairwise disjoint. Verification of this fact is 
left as an exercise. (This fact is important for the validity of this proof. Try a proof by 
induction.) 


We define the function f* : S — T as follows: 


* f(x) if wEeU 

Po A l if sgU 

— We verify that the image of S under f* is a subset of T: 
Any element a in S is either in U or is not in U. If a € U, then f*(a) = f(a) € Si 
for some i > 1 and so f*(a) € T. Ifa g U, then it is not in So = S — T and so is in 
T. So the image of S under f* is in T. 

— We verify that f* is one-to-one on S: 
Since f“ = fly on U (on which f is one-to-one) and is the identity map on S —U, 
then f* is one-to-one on S. (Some details are left as an exercise. The reader should 
see clearly why this is true.) 


— We verify that f“ is onto T: 
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Let te T. Ift € T—U, then f“ maps t to t. Suppose t € U. Then t € S; for some 
i > 0 (t cannot be in So since So = S—T). Then t is in the image of S;-; under 


f and so is in the image of T under f“. So every element of T is in the image of S 
under f“. 


So f“ maps S one-to-one onto T as required. 


Proof of the Schréder-Bernstein theorem. 
Proof: 


What we are given: There exists a one-to-one function, f : S — T, mapping S into T 
and a one-to-one function, g : T — S, mapping T into S. 
What we are required to show: There is a one-to-one function which maps T onto S. 


Let h = gof. Then A is a function mapping S into S. Since both f and g are one-to-one 
on their respective domains, then A is one-to-one on S. Then 


By the lemma, there exists a one-to-one function h* : S — g[T] mapping S onto g[T| € 
Then h*~' : g[T] > S maps g[T] one-to-one onto S. This means that (h*‘.g)[T] = 
That is, the function hg maps T one-to-one onto S. 

Thus, S and T are equipotent, as required. 


S. 
S. 


21.3 Consequences of the Schröder-Bernstein theorem. 


The important Schréder-Bernstein theorem will allow us to prove statements we sus- 
pected were true but for which we lacked the tools to confirm our suspicions. It shows 
that the relation, <e, on & = {[A].: A E€ F} is antisymmetric since [A]. <e [B]e and 
[B]e <e [A]e implies [A]. = [B]e. Hence, <e is a non-strict ordering of &. 


Theorem 21.3 The set R of all real numbers is equipotent to A(N). 
Proof: 


We proved in theorem 20.13 and 20.14 that R is embedded in A(N) and A(N) is embed- 
ded in R. Then by the Schréder-Bernstein theorem R ~e P(N). 
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21.4 


Since R ~e Y(N), then [R]. = [A(N)]-. By theorem 20.7, A(R) ~e A(A(N)), and 
so [A(R)]. = [A(A(N))]e = [F?(N)]e. It easily follows, by mathematical induction, 
that the two countably infinite <--ordered sets 


{[O]es [es [2le-- +s [Ne Re; [P (Ryle [PR]. [F*(R)le,-- } 
(ole Me Bless Ne [PM] e P Me [P?N)]e Ne] 


have the same elements and so represent identical <e-chains in the class 
& = {[S]e : S isa set}. 





The set NN, 


In theorem 20.12, we saw that for any set S, 
25 we {xr:T CS} ~ve P(S) 


Hence, 2N ~e {yr : T CN} ~e P(N), where 2N is the set of all functions mapping N 
into {0,1}. The set 2‘ was also seen to be equipotent to the set, 
{ {a0, a1, @2,a3,...,} : a; € {0, 1} }.4 


We will now investigate the “set of all functions mapping N into N”. That is, we will 
consider a set whose elements are of the form, f = {(7,a;) : i E€ N,a; E€ N}. To be 
consistent with our notation, we will express this set as 


NN 


For example, g = {(0, 1), (2,4), (3,9), (4,16),...,} represents a particular element 
of the set NN where n is mapped to n?. We could also represent this element as, 


(ao, a1, @2,a3,...,), where a; = i”. That is, each a; is associated to the element 


(4,77)? 

The sets, NN and 2, are both sets of functions with domain, N, except the functions 
in 2N have range, {0,1}, while the functions in NN have range, N. Not surprisingly, if 
g € 2N, then g € NN; hence, 2N c NN. Of course, NA contains many elements which 
do not belong to 2N. For example, {(i, i?) : i € N} belongs to NN but not to 2N. 
But it may still be possible for NN to be equipotent to 2N. If we can show that NN is 
embedded in 2N, it will follow from the Schréder-Bernstein theorem that [NN]. = [2%]. 


Theorem 21.4 The sets, NN and R, are equipotent. 


"If A; = {0, 1} for i = 0,1, 2, 3,... we define [],<y Ai = {(ao, a1, @2,...,) : ai € {0, 1}}. Then [Jien Ai ~e 


2N 


?If A; = N for i = 0,1,2,3,...,, we define [lien 4: = {(a0,a1,a2,a3,...,) : ai E N}. Or if one prefers, 
[lien Ai can be viewed as the set of all possible countably infinite sequences of natural numbers. The element 
g = {(0, 1), (2, 4), (3, 9), (4, 16),..., } can be viewed as (0, 1,4,9, 16, ...,) € [lien Ai = {(a0, 41, 02, 03,...,): 
ai € N}. In fact, Tien Aj we NS. 
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Proof: 


What we are given: NY is the set of all functions mapping N into N. 
What we are required to show: N and R are equipotent. 


Claim: R is embedded in NN. 
— We have shown that R ~e 2N c NN; hence, R is embedded in NN. 
Claim: NN is embedded in R. 
— Let f € NN. Then f can be expressed in the form f = {(0, ao), (1, a1), (2,a2),...} a 
subset of N x N. Since f CN x N, then f € A(N x N). Then 
NN c A(NxN) 
~e P(N) (By 20.4, N x N ~e N, followed by 20.7.) 
~e R 
Then NN is embedded in R as claimed. 
By the Schröder-Bernstein theorem NN ~e P(N) ~e R. 


21.5 The set BA of all functions mapping A into B. 


Since we are discussing sets of functions such as, NN and 2°, we slightly generalize 
these notions by considering ranges other than {0,1} and N. 


Definition 21.5 If A and B are two sets, then the symbol, B4, refers to the set of all 
functions mapping A into B.! 


Examples: 


a) The set Q denotes the set of all functions f : N — Q. For example, 
{zi : xi =1/(i+1), i€ N} = {4,4,5,3} 
is such a function. We can of course say that Q is the set of all infinite countable 
sequences of rational numbers. 
b) If S contains 3 elements and T contains 4 elements we can verify that the set ST 
will contain 34 elements. 


1The following argument confirms that if A and B are sets, then A® is a set: Every element f € AP is 
a subset of the set B x A (finite products of sets are sets). So for every f € AP”, f € P(B x A). Then 
A?” C P(B x A). Since P(B x A) is a set (Axiom of power set), then A? must be a set (Axiom of subset). 
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We wonder how the sets, Q and NN, are related. It is clear that if 

x = {(0, no), (1, n1), (2, n2), (3,n3),..-, } € NN, then x € QN; hence, NN c QN. On 
the other hand, we know that Q and N are equipotent; so there exists a one-to- 
one function f : Q — N mapping Q onto N. It then follows that for any element 
x = {(0,q0), (1, qı), (2, g2), (3, q3),..-,} in Qï, we can associate a unique element 


F(x) Shy {(0, f(4q)), (1, f(a)), (2, f(4q2)), (3, f (43), oa) }) in NN. So QN is embed- 
ded in NN. Hence, by the Schréder-Bernstein theorem, Q and NN are equipotent. 


Concepts review: 


1. 


2: 


What does the Schroder-Bernstein theorem say? 


Name three sets which are equipotent to the power set A(N). 


. What do the symbols 2N and NN mean? 
. Is the set NN equipotent with R? 


. What does the expression B4 mean? If B has 3 elements and A has 2 elements how 


many elements does B“ contain? How many elements does A? contain? 


EXERCISES 


. Show that an infinite set S is countable if and only if S is equipotent with every one 


of its infinite subsets. 


. Prove that an infinite countable set S can be expressed as the union of two disjoint 


infinite countable sets. 


. Prove that if S and T are sets and S — T and T — S are equipotent, then S and T 


are equipotent. 


. Prove that for any m € N, N” is countable. 
. If S = {0,1,2} and T = {z, y} write out explicitly the elements of the following sets: 


. Show that NN is equipotent with a subset of A(N x N). 
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7. Show that A(N x N) is equipotent with a subset of R. 
8. Show that the set {.5; : i € N} of sets constructed in the proof of lemma 21.2 are 


11. 


12. 


13. 


14. 


pairwise disjoint. 


. Show that the function h constructed in the proof of lemma 21.2 is one-to-one on S. 
10. 


Suppose R is an equivalence relation on an infinite countable set S. Show that the 
set of all equivalence classes on S induced by R is countable. 


Let S be the set of all infinite sequences of natural numbers. We will say that a 
sequence s = {s; : i € N} € S has a “constant tail-end” if there exists a number 
k € N such that i > k > si = sp. Let T = {s € S : s has a constant tail-end }. 
Show that T is countable. 

In theorem 20.5 it is proven that if the A;’s are countable, then, for any n, [[;—o Ai 
is countable. Give an example that shows that [],-)j A; need not be countable. 
Explain. 

A subset T of R is said to be open if for every element x € T, x is contained in 
an open interval entirely contained in T. Let Z be a set of pairwise disjoint open 
subsets of R. Show that Z is countable.’ (Hint: Let U = QN (Usgey S). A step 
invoking the Axiom of choice will follow.) 

Show that Q € [R]e. 


'The statement “Any infinite linearly ordered set V such that the set .Y of pairwise disjoint open subsets 
is at most countable must be equipotent with R.” is referred to as the Suslin’s problem. It remained an 
open question until it was proved that it is impossible to prove or disprove this statement from ZF plus the 
Axiom of choice. 
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22 / An introduction to cardinal numbers. 


22.1 


Summary. In this section we state the Continuum hypothesis and the Gener- 
alized continuum hypothesis; we discuss their meaning and consequences. We 
“informally” define the class of cardinal numbers, € , introduce the “aleph” nota- 
tion for cardinal numbers and define addition, multiplication and exponentiation 
of cardinal numbers. Finally, we show that the class of all cardinal numbers is 
a proper class. 


The equipotence-based classification of sets. 


In the last few sections, we have used the equipotence relation to subdivide the class, 
F , of all sets into subclasses, [S]e, of mutually equipotent sets. As was done previously 
(on page 197), we will continue to represent the class of all ~e-equivalence classes on 
S as 

& ={[S]-: S is a set} 


In 20.10, we defined the relation, <e, on & as, 
([A]e <e [B]e) & (A =e B) 


The relation, <e, was seen to be a strict order relation on @. 

We proved two fundamental results concerning the elements of &: 
1) For all sets S, [S]e <e [A(S)]e, proven in theorem 20.8, 
2) [R]e = [Z (N)]e, proven in theorem 21.3. 


Combining these two statements, along with the proposition 20.11, allows us to con- 
clude that the set 


A = {0e e- ie Nle LZ Ne, [Z N)]e, ---, [A'(N)]e,-..,} 


of distinct equipotence-induced equivalence classes, linearly ordered by the relation 
<e, not only contains [R]e but also the equivalence classes of all power sets generated 
by R. That is, [A(N)]e = [R]e, [Z2 (N)]e = [A(R)]-; more generally 


[Pr (Ne = [P"R)]. 


A few words of caution: Even though we have shown that <e linearly orders the set, 
sf, described above, we have not proven that <e linearly orders the class &, even 
though we suspect that it does. We will not assume this to be the case until we 
formally prove it to be true. 
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22.2 The Continuum hypothesis. 


A particularly intriguing question concerning the strictly ordered set of equivalence 
classes described above baffled mathematicians for decades: “Does there exist an 
uncountable set S (that is, one which is not equipotent with N) which is prop- 
erly embedded in R ~e A(N)?” Equivalently, “Does there exist a set S such that 
[NJe <e [S]e <e [Rle = [A(N)]e? ” 


After numerous attempts to construct such a set S in vain, Georg Cantor came to 
believe that no such set S exists. In 1878, he conjectured that: 


“No uncountable set can be properly embedded! in R.” 


This conjecture is referred to as the Continuum hypothesis, abbreviated by CH.? In 
1900, the mathematician David Hilbert declared that proving, or disproving, the Con- 
tinuum hypothesis was one of the 23 most important unsolved mathematical questions 
of that time. In 1940, Kurt Gödel showed it is impossible to disprove the Continuum 
hypothesis within ZFC. In 1963, Paul Cohen showed that it is impossible to prove that 
the Continuum hypothesis holds true within ZFC. This settled the question: Neither 
assuming “CH is true” nor assuming “CH is false” can lead us to a contradiction. This 
means that we are free to work in a universe governed by ZFC+CH or by ZFC+—CH, 
as we prefer, without fear of a contradiction evolving from the annexation of the ax- 
iom CH or -=CH to ZFC.’ 


There is a more general version of the Continuum hypothesis called the Generalized 
Continuum Hypothesis, abbreviated by GCH, which states that: 


“For any infinite set A there does not exist a set S such that 
A >e Soe P(A). 


Or we can equivalently say, “For every set A there does not exist a set S such that 
[A]e <e [S]e <e [A(A)]-”. The GCH only refers to infinite sets, not finite ones. The 
Generalized continuum hypothesis implies the Continuum hypothesis. It is, however, 
known that GCH does not follow from CH. Assuming ZFC+GCH, the linearly ordered 
set 


A = {[0le, e -Ine -a [Ne [A(Ne, (F7(N)e, [A (Ne, ---,} 


for example, is an “initial segment” of equivalence classes in the sense that for any 
n, {[S]e € £ : [S]e <e A”(N)} C &. This does not say that this set represents all 
equipotence-induced equivalence classes, far from it. It simply means that assuming 


Recall that “A is properly embedded in B” means that A is equipotent with a subset of B but B is not 
equipotent with A nor any of its subsets. 

?The word continuum is simply another way of referring to the set R. 

3The negation of CH is represented as “ACH”. The symbol “=” is often read as “not”. 
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22.3 


GCH, such a set forms a string of countably many equivalence classes, with none miss- 
ing. In the ZFC+—GCH universe, for each n > 0, {[S]e € £ : [S]e <e A" (N)} CZ A. 


It is also known that neither GCH nor —GCH can be proved in ZFC.1 


This leaves us with an important question: When trying to determine whether a 
mathematical statement holds true or not, should we assume CH or “not CH”? If a 
statement can be proved without invoking either of these axioms, most readers will 
prefer the proof which avoids these statements (viewing them as being extraneous). 
However, some mathematical statements have as only proof one which assumes CH 
(or GCH). In such cases, the reader should be alerted to this fact, and should be 
informed at which point in the proof this axiom is invoked. 


The class, &, of equipotence induced equivalence classes. 


The reader has no doubt noticed that we have been careful not to refer to the class 
E={(Sle: SEF} 


of all ~,-induced equivalence classes on Z as a “set of sets”. It is not difficult to show 
that & can not be a set of sets. 


Theorem 22.1 The class, & = {[S]-: S E€ Z}, is not a set of sets. 


Proof: 


Proof by contradiction. Suppose & is a set of sets. 


Then, by the Axiom of choice, there exists a choice function, f, which maps every 
element [S]e € € to some element s in [S]e C Z. That is, f chooses from each element 
[S]e of & a set representative, f([S]e) = $ ~e S. 

Let Z = f[E] denote the image of £ under f. By the Axiom of replacement, the image 
of a set under a well-defined function is a set; hence, # is a set of sets. 

Then T = U{s: s € Z} is the union of a set of sets. Hence, by the Axiom of union, T 
is a set. 

By the Axiom of power set, A(T) is also a set. Since A(T) is a set, A(T) € [S]e 
for some [S]e € £. Then A(T) ~e s for some s € Z. But s C T. By transitivity, 
P(T) >e T, a contradiction. 


The source of our contradiction is the assumption that & is a set of sets. 


1 Interestingly, it was shown by Sierpinski that the Axiom of choice follows from ZF-+GCH. That is, the 
Axiom of choice exists in a universe governed by ZF+GCH. 
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22.4 Cardinal numbers. 


We pause to examine more closely the equivalence class [2], the class of all sets which 
are equipotent to the natural number 2. Examples of a few elements which belong to 
[2]. are the sets 


{9, 7} 
{R, N} 


{ 2,{{ ot} } 
{ DLUH } 


each of which is equipotent to 2. We also see that [2]. = [{9, 7}].. Of course, being 
equipotent to itself, 2 = {@,{@}} also belongs to [2]e. In fact, it is the only natural 
number which is an element of [2]e. If we represent the class, [2]e, in this way, rather 
than representing it as, say [{9,7}]e, it is because we surreptitiously selected the set 
2 = {Ø,{Ø}} as being the “official” representative of this class. In fact, we have 
chosen the natural numbers as the official representatives of all equivalence classes 
whose elements are finite sets. On the other hand, possible representatives of [R]e and 
[A(R)]e are R and A(R), respectively. But we could of course have used A(N) and 
P(N), respectively. 


It would be convenient to uniquely specify a class representative for each element of 
&. Determining how we can select a set from each and every equivalence class in & is, 
however, not obvious. The Axiom of choice states that there is a choice function that 
allows us to select an element from each set in a “set of sets”. But & is not a “set of 
sets”. So the Axiom of choice is not available to us as a tool for selecting an element 
in each set in &.! We need to identify a specific property possessed by a single set in 
[S]e which clearly distinguishes it from all other sets in [S]-. Unfortunately, at this 
time, we have not yet sufficiently explored our universe of sets to be able to identify 
what this set property could be. 


For the time being, we will postulate the existence of a class, @, containing the unique 
representatives of each and every single ~,-equivalence class in &. We will refer to 
these class representatives as cardinal numbers.? 


Postulate 22.2 There exists a class of sets @ which satisfies the following properties: 


1. Every natural number n is an element of @. 


1We could use each equivalent class in & as “self-representatives” and call them cardinal numbers. The 
problem with this is that these equivalence class are not known to be sets. We want a set which represents 
each equivalence class in &. 

?For those readers who wish to read ahead, the sets that we will call “cardinal numbers” are the initial 
ordinals. 
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2. Any set S € Z is equipotent to precisely one element in @ 


The sets in @ are called cardinal numbers. When we say that a set, S, has cardinality k, 
we mean that x € @ and that S ~e K, or equivalently, S € [k]e. If the set S has cardinality 
k, we will write, |S| = x. 


22.5 


We emphasize that we postulate the existence of the cardinal numbers, @, imme- 
diately, for convenience only. We will eventually prove that the class @, as defined 
above, exists. 


Definitions and notation associated to cardinal numbers. 


Given any set S, the expression, |S|, was said to denote the cardinality of S. When 
referring to a “generic” uncountably infinite cardinal number (that is, a cardinal num- 
ber which is not the cardinality of an explicitly specified uncountable set S) we will 
represent it by a Greek letter such as & or \.? For example, we would represent a 
cardinal number as a Greek letter in a phrase such as “Suppose S is a set whose 
cardinality is «K, ...”. However, we don’t normally represent a finite cardinal number 
by a Greek letter. When referring to some finite set, F, containing, say, n elements, 
it’s cardinality is the natural number n, so we write, |F| =n. Whenever the natural 
number n is viewed as the representative of all sets which contain n elements, we refer 
to it as a cardinal number. 


The aleph notation. It is customary to express the cardinality of countably infinite 
sets by using the “aleph” notation: We write |N| = No € @. The symbol “No” is 
pronounced “aleph-not”. For example, since we have shown that N x N ~e N and that 
N —e R, we can write |N x N| = No, while |R| 4 No. 


The cardinality of R. The cardinality of the set R is often represented by the sym- 
bol c.! Since R, A(N) and 2N were shown to be equipotent, then we can write 
| A(N)| = |2N| = c. If we assume the Continuum hypothesis (CH), there can be no 
uncountable cardinal number « such that N >e k >e c. That is, CH implies that 
there are no cardinal numbers between No and c. If we assume 4CH, then we are 
saying that there exists an uncountable cardinal « such that N >e k >e C. 


Since we are referring to class representatives as “numbers” we will replace the symbols 
>e with < and >e~ with <. That is, if k = |S| and A = |T] 


K<ASS Oo, T 


KSASS OW T 


?The Greek letter « is read as “kappa”. The Greek letter À is read as “lambda”. 
3The aleph notation was introduced by Georg Cantor. 
1The symbol c abbreviates the word “continuum”, another way of referring to the set R. 
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Transfinite cardinal numbers are those cardinal numbers which are infinite sets. The 
natural numbers are those cardinal numbers which are not transfinite. 


Three operations on cardinal numbers. 


Having explicitly provided symbols, 0, 1, 2, 3, ..., Xo, c, for the first few cardinal num- 
bers, we now develop methods to construct from these, other cardinal numbers not 
found in this list. Methods used to construct new sets from known sets will by applied 
to construct new cardinal numbers from known ones. For example, given two sets, A 
and B, we defined new sets such as C = AUB, D = Ax Band AP. (Recall that AP 
denotes the set of all functions mapping the set B into A.) We will translate these 
three set operations to cardinal number operations. For example, it may be useful to 
know how the cardinality of the sets AU B, A x B and A? compare with the cardi- 
nality of two sets, A and B. Once we formally define operations on cardinal numbers 
we will try to determine if there are general principles that can be used to more ef- 
ficiently compute the value of the cardinal numbers which result from these operations. 


Definition 22.3 If S and T are sets and « = | S| and A = |T|, then we define addition“+” , 
multiplication “x” and exponentiation of two cardinal numbers as follows: 


a) 


b) 


ESAT =, 
K+A=|SUT| 
Bx NS |S xT] 
kò = |S? 





where ST represents the set of all functions mapping T into S (as previously defined). 
That is, |S|!7! = |S7|. For convenience, we define 0* = 0 and k? = 1. 


If two sets, S and T, have non-empty intersection, it is still possible to determine 
|S| + |T| by proceeding as follows: 
(s =|S)) (x = |S x {0}}) 
(A = |T) (A =|T x {1})) 
(SRILA XI) =o > ct rA=—((8 x10} U(T x {1}) 














One should verify that the definitions of sums, products and exponents of cardinal 
numbers agree with the operations we perform with finite cardinal numbers (the nat- 
ural numbers). Suppose, for example, that A contains 4 elements and B contains 
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2 elements. Then there are 16 = 4? elements in AP. Verify this by listing all the 
elements in A?. Also there are 4 x 2 = 8 elements in A x B and 4+ 2 = 6 elements 
in AU B (assuming that A and B have no elements in common). Verify this fact. 


Examples: 
a) What is the cardinality of the set F = {9, {9}, {9@, {a}}}? 
The cardinal numbers of all finite sets are the natural numbers; hence, |F| = 3. 
b) What is the cardinality of the set {2, {3}}? 
The cardinal number of a set does not depend on the type of elements it contains, 
so |{2, {3}}| = 2. 
c) What is the cardinal number of each of the sets Q, Z, and N x N x N? 
They all have the same cardinal number since they are equipotent: 
|Q| = |Z| = |N x N x N| = IN| = No 
d) If A= {1,2,3} and B = {13,14} the cardinalities of A and B are 3 and 2 respec- 
tively. If we view 3 and 2 as cardinal numbers show, by referring to the definition 
of addition, multiplication and exponentiation of cardinal numbers, that 3+2 = 5, 
3 x 2 = 6 and 3? = 9. 
- By definition: 3 + 2 = |A U B| = |{1, 2,3, 13,14}| = 5. 
- By definition: 
3x 2=|Ax B| = KG, 13), (1, 14), (2, 13), (2, 14), (3, 18), (3, 14)}] = 6 
- By definition: 3? = |A®| is equal to the cardinality of the following set of func- 
tions mapping {13, 14} into {1, 2,3}: 
{(13, 1), (14, 1)} 
{(13, 2), (14, 2)} 
{(13, 3), (14, 3)} 
{(13, 1), (14, 2)} 
{(13, 1), (14, 3)} 
{(13, 2), (14, 1)} 
{(13, 2), (14, 3)} 
{(13, 3), (14, 1)} 
{(13, 3), (14, 2)} 
Since there are precisely nine functions in this set, 3? = 9. 
e) What is the cardinality of the two sets 2“ and A(N). 
|25| = |A(N)| = |R| = 
f) Compute 2%, 


2 = AM = |M] = c 
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We will formally show that the class, @, of all cardinal numbers is not a set. The proof 
mimics the one used to show that the class £ is not a set of sets. 


Theorem 22.4 The class @ of all cardinal numbers is a proper class. 


Proof: 


Suppose @ is a set. Then T = U,,-y * must be a set (by the Axiom of union). This implies 
P(T) must be a set (Axiom of power set). Since A(T) is a set, it has a cardinality, say, 
à. Then A(T) ~ve AC T. So A(T) is equipotent to a subset of T, contradicting the 
previously established fact, T œe Y(T). So @ cannot be a set. 


22.7 Previous theorem statements using cardinal number notation. 


Many of the statements proven in the last few sections can now be restated using 
cardinal number notation. The results are from sections 19, 20 and 21. Let S and T 
be sets. Suppose that « = |S], A = |T]. 


If |S| = No andT C S, then either |T| = n for some n € N or |T| = No 
(theorem 19.3) 

If |S] =« = |T| if and only if S ~ve T. (postulate 22.2) 

If Xo < «K, then k +XNo =K. (theorem 20.6) 

If k = |S], then |A(S)|=2*. (theorem 20.12) 

For all cardinal numbers «x, k < 2" (equivalently, k >e 2"). (theorem 20.8) 

|R| =c=2%° (theorem 21.3) 

Continuum hypothesis: There does not exist a cardinal number « such that No < 
k<c=|RI. 

Generalized continuum hypothesis: For any cardinal number « there does not exist 
a cardinal number A such that k < A < 2". 


Concepts review: 


l: 


What does the Continuum hypothesis say? What does the negation of the Continuum 
hypothesis say? Which one holds true in ZFC? 


. State the Generalized continuum hypothesis. 
. Define the class of all cardinal numbers. 
. Describe the finite cardinal numbers. 


. What symbol is used to represent the cardinality of the set R? 
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. What symbol is used to represent the cardinality of N? 
. Which cardinal numbers are referred to as being transfinite cardinal numbers? 


. How are the operations of addition, multiplication and exponentiation of cardinal 


numbers defined? 


. Can the class of all cardinal numbers be referred to as a set? Why? 


EXERCISES 


— 


. Show that there can be no largest cardinal number. 
. Show that for every finite cardinal number n, n € No. 


. Prove that if So, T and T —, M, then S$, M. 


4. Find U{k € @: « isa finite cardinal number}. 


. Determine the cardinal number equal to each of the following expressions: 
a) c+No 
b) No x No 
c) 250 
d) Nå 
. What is the cardinality of the set 22? 
. What is the cardinality of the set NN? 


. Prove that if SCT C M and S ~e M, then S ~e T. 
. Prove that No + c = c + No. 

. Prove that Xo x c = c x No. 

. Prove that No < 250., 

. Show that the class of all infinite sets is not a set. 
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23 / Addition and multiplication in @. 


Summary. We have defined addition and multiplication of cardinal numbers in 
such a way that when adding or multiplying finite cardinal numbers, we obtain 
precisely the same answers as the ones obtained when performing these opera- 
tions with natural numbers in the conventional way. In this section we verify that 
these two operations are well-defined even when adding and multiplying infinite 
cardinals. We then verify that addition and multiplication of cardinals satisfy, 
in many cases, the same properties as addition and multiplication of natural 
numbers. But not all of their properties generalize from the natural numbers to 
infinite cardinal numbers. 


23.1 Reviewing basic facts about @. 


We have seen that every element, «, of the class @ of all cardinal numbers is a set 
around which is gathered in a single class, [k]e, all sets S such that S ~e K. Every 
~--equivalence class in .Y contains exactly one cardinal number. The finite cardinal 
numbers were declared to be the natural numbers. The elements of @ are ordered by 
the relation “<” where « < A if and only if k >e A, and « < A if and only if k GW À. 
We can not yet declare that < linearly orders @ since we have not yet shown that 
>e~ linearly orders Z (although we certainly would like this to be the case). We 
will now study the properties of the two operations, + and x, previously defined on @. 


23.2 Addition of cardinal numbers. 


Addition of two cardinal numbers was defined as follows: “If x = |S| and A = |T] 
where SMT = Ø, then «K + À is equal to the cardinal number |S UT|”. We can easily 
see that if S and T are disjoint finite sets, addition of the cardinal numbers |.$| and |T] 
is simply the number of elements in the set obtained when we merge both sets into one. 


Although we are sure that addition of finite cardinals is well-defined, we should also 
verify that addition of any pair of cardinal numbers is well-defined. 


Theorem 23.1 Addition on @ is well-defined. That is, if S1, S2, Tı and Tọ are sets 
such that s = |S,| = |S2| and A = |T] = [D| and S$; NT, = Ø = S&N To, then 
|S; UT] =K +A = |SU TI. 


Proof: 


What we are given: That S1 N Ti = Ø = S2 N To, Sı and S2 are equipotent and T; and 
To are equipotent. 
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What we are required to show: That |S1UT;| and |S2:U79| are the same cardinal number. 


Since S1, S2 and Ti, To are equipotent pairs, then there exist one-to-one onto functions 


Fa Sie > S2 
gh > h 
By definition of addition, we have 
k+A = |S] +I] = [Sy UT] 
K+A=|So|+|To| = [UT] 


To prove that addition is well-defined, it suffices to show that |S; U Til = |S2 U Tol, i.e., 
that Sı U Tı and Sg U To are equipotent: 
— We define the function h : S1 U Ti > S2 UT» as follows: hls, = f and h| s, = g. 


— Since Sı and S2 are disjoint and both f and g are one-to-one and onto, then A is a 
well-defined one-to-one and onto function. 


— So Sı UT; and S2 U Ts are equipotent. Thus, |S; U Til = |S U T>| as required. 


We now verify that addition on @, thus defined, satisfies most of the basic addition prop- 
erties. 


Theorem 23.2 Let x, 4, 6 and w be any four cardinal numbers. Then 
a) K+A=A+K  (Commutativity of addition) 
b) (K+A) + G=K4 (A+@)  (Associativity of addition) 
c) KIRA 


d)kK<Aandd<YSKtO<AtV. 


Proof 


a) Let S and T be disjoint sets such that « = |S| and A = |T|. To prove that K+ = A+K 
it suffices to prove that SUT ~e TUS. This is left as an exercise. 

b) Let S, T and F be disjoint sets such that « = |S|, A = |T| and ¢=|F|. To prove that 
(«+A)+¢=K+(A+4¢) it suffices to show that (SUT)UF ~e SU(TUF). This is 
left as an exercise. 

c) Let S and T be disjoint sets such that « = |S| and \ = |T|. Since S and T are disjoint, 
we see that S can be mapped one-to-one into the subset S of SUT. Hence, k < K+ .. 
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d) Let {S,F} and {T, P} be to pairs of disjoint sets such that «x = |S| < \ = |T| and 


23.3 


o = |F| < y = |P|. The case where we have equality is straightforward. We will only 
prove the case involving the strict inequality “<”. Assuming k < A and ¢@ < y, 


Soa.T =e TUP S =e TUP 
F=. P => TUP F =e TUP 


Since T and P are disjoint SU F >e TU P. Then k+ < à+. 


On cancelling out terms in addition. Not all addition properties which hold true for 
finite cardinals extend to infinite cardinals. For example, for finite cardinals m, n, k 
the statement 


(m+n=m+k)>n=k 


is always true. But for arbitrary cardinals «K, A, Y, if k+ ÀA = k + y, it does not 
necessarily follow that A = w. Recall (from theorem 20.6) that if 


“If k > No and À < No, then K = Kk + A” 


is shown to be true. For example, c + 0 = c + No does not imply that No is 0. Even 
though, for a non-zero finite cardinal n, the expression n = n +n doesn’t make sense, 
we will soon show that for any infinite cardinal «, it is always true that k = K +K. 


Multiplication of cardinal numbers. 


As previously stated, multiplication of two cardinal numbers is defined as being the 
cardinal number of the Cartesian product of the sets they represent. Just as we have 
done for addition we confirm that multiplication of cardinal numbers is well-defined. 


Theorem 23.3 Multiplication on @ is well-defined. That is, if S1, S2, T) and Tọ are sets 
such that « = |S1| = |S2| and A = |T;| = |T9], then 


|S x Ti] =r x à= |S2 x Tol 


Proof: 


What we are given: That Sı and So are equipotent, 7; and Tọ are equipotent. 
What we are required to show: That |S1 x T| and |S2 x Ty| are the same cardinal number. 


Since S1, S2 and Ti, To are equipotent pairs, then there exist one-to-one onto functions 


f: > Be 
gh > To 


Part VI: Cardinal numbers 225 


By definition of multiplication, we have 
k X A= |S] x Tl = [S1 x Til 
k X A= |S2| x |T>| = |S x Tol 


To prove that multiplication is well-defined it suffices to show that |S; x Ti| =|S2 x Tol, 
i.e., that Sı x Tı and S2 x To are equipotent: 


- We define the function h : S1 x Ti = S2 x To as follows: h(s,t) = (f(s), g(t)). 


- Since both f and g are one-to-one and onto, then h is a well-defined one-to-one and 
onto function.! 


- So Sı x Tı and S2 x To are equipotent. 
So multiplication is well-defined. 
We now describe and prove a few of the most basic multiplication properties on @. 


We will see that most (but not all) of the multiplication properties which hold true 
for the natural numbers extend to infinite cardinal numbers. 


Theorem 23.4 Let x, à, 6 and w be any three cardinal numbers. Then 
a) KX A=XAX K  (Commutativity of multiplication) 

b) (kx A) xX 6=KX (AX Ø) (Associativity of multiplication) 

c) KX (A+) =(K XA) + (KX p) (Left-hand distributivity) 

d) A>0Sk< (Kx 4) 

e) K<Aand d<YWSKXGO<AXY. 

f) K+K=2XK, 


g) K+K< KX when Kk > 2. 


Proof 


a) Let S and T be sets such that « = |S| and A = |T]. 


What we are required to show: That k x X= kK. 
To attain this result it suffices to show that Sx T ~e T x S. 


'To see this, note that (f(s1),9(t1)) = (f(s2), g(t2)) implies that f(si1) = f(s2) and g(t1) = g(t2) which 
implies that sı = s2 and tı = te > (s1,t1) = (s2, t2). 
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Let h: Sx T + T x S be defined as h(s, t) = (t, s). Now 


(s,t)= (a,b) & s=aandt=b 
= (b,a) = (t,s) 


Then A is a one-to-one function. 
Also, if (u,v) E€ T x S, then (v,u) € S x T and h(v, u) = (u,v). So h is onto T x S. 
We conclude that S x T ~e T x S, as required. 


Let S, T and F be sets such that « = |S|, A = |T| and ¢ = |F]. 

What we are required to show: That x x (A x ¢) = (A x k) x @). 

To attain this result it suffices to show that (S x T) x F ~e T x (S x F). This is 
proven in theorem 4.9. 

Let S, T and F be sets such that k = |S|, A = |T| and ¢ = |F|. Without loss of 
generality, suppose T and F are disjoint. 

What we are required to show: That multiplication of cardinal numbers is left-hand 
distributive, i.e., x x (A+ ¢) = (Kx A) + (Kx @). 

By definition, A+ ¢=|TUF|,«x A= |S x T| and k x 6=|S x F|. So 


exOA+¢6) = |SK(TUT)| 
(S x T) U (S x F)| (By theorem 4.7 b) ). 


— |S x T| + |S x F| (Since T and F are disjoint > S x T and S x F are disjoint). 


= (Kx A)+(K x ) 


Let S and T be sets such that « = |S| and A = |T|. Let a € T. 

The property s < (K x A) follows from the fact that the function h : S — S x T defined 
as h(s) = (s,a) maps S one-to-one onto S x {a} C S x T. The details are left as an 
exercise. 

Let S, T, F and P be sets such that « = |S|, A = |T|, 6 = |F| and 4% =|P|. Suppose 
f :S—T maps S one-to-one into T and g : F — P maps F one-to-one into P. That 
is, suppose that k < Aand@d<w 

What we are required to show: That kx @< Ax w. 

To show this, it suffices to show a function h : S x F — T x P which maps S x F 
one-to-one into T x P. 

The function h : S x F — T x P defined as h(s, u) = (f(s), g(u)) can be shown to be 
one-to-one. This is left as an exercise. 

What we are given: That S is a set such that « = |S| and 2 is the cardinal number of 
the set {0, 1}. 

What we are required to show: That k +4 = 2x k. 

Note that « = |S| implies x = |S x {0}| = |S x {1}|. Then, by definition, 


Part VI: Cardinal numbers 227 


KFK (S x {0}) U (S x {1})| (since S x {0} and S x {1} are disjoint). 
|S x ({0} U {1})| (By theorem 4.7 b) ). 
= |S x {0,1} 


F |{0, 1} x S| (since S x {0,1} and {0,1} x S are equipotent). 


2XK 


g) What we are given: That S is a set such that «x = |S| > 2. 
What we are required to show: That K+K <k xX kK. 


By part f), k+ K =2 x K. So it suffices to show that 2x K <KX k. 

Since 2 < k = |S|, then 2 = {0,1} is embedded in S. Let f : {0,1} ~ S bea 
one-to-one function mapping {0,1} into S. Then the function h: {0,1}x S3~ SxS 
defined as h(i, s) = (f(t), s) can be seen as being one-to-one. Showing this is left as 
an exercise. It follows that 2 x k < k X k. SOK+K <k X kK, as required. 


Concepts review: 


1. How do we go about showing that addition and multiplication of cardinal numbers 
are “well-defined” ? 


2. Is addition of cardinal numbers commutative? Is it associative? 
3. Is multiplication of cardinal numbers commutative? Is it associative? 


4. Does A+ K = à +Y imply K = 4? If so why? If not, give an example showing why 
not. 


EXERCISES 


1. Show that |S < |S7| for any set S and non-empty set T. 


2. Let x, A, @ and w be any three cardinal numbers. Show the details of the proofs of 
the following statements: 


a) K+AH=AHK 
b) (KA +A) + ¢=A4+(K+¢) 
c) K<Aand d<YSKtGO<AF4 Vv. 
3. Let x, A, ¢ and w be any three cardinal numbers. Show the details of the proofs of 
the following statements: 
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a) 
b) ey rer = vx (K xo) 
c) A>OSK< (Kx 2X) 
d) K<Aandd<YvSRKxd<rxy. 

e) K+K< 4k XK When k > 2. 
4. Show that 2% = |R — NI. 
5. Show that for any cardinal number Kk, k+K+K+KR=4XKR. 
6. Let n be a finite cardinal number. Prove that: 

a) n + Xo = No. 

b) n x No = Xo. 
c) n+ 2 = 20, 
d) n x 20 = 20, 
e) No +2% = 20, 
f) No x 2% = 20, 


7. Prove that if k x A = 0, then either k = 0 or A = 0. 
8. Prove that if k x A = 1, then either k= 1 or à= 1. 
9. Prove that if k x A = No, then either «k = No or A = No. 


Part VII: Cardinal numbers 


229 


24 / Exponentiation of cardinal numbers. 


24.1 


Summary. In this section we show that cardinal exponentiation is well-defined. 
We then prove three of the most basic properties of cardinal exponentiation 
as well as inequalities involving cardinal exponentiation. We also show that 
[Re = £ = 2, 


Cardinal exponentiation. 


Given two sets, A and B, we have seen that the expression A? represents the set of 
all functions mapping B into A. This means that every element, f € AP, is a subset 
of Bx A. Also, for every pair of ordered pairs in f of the form (x, u) and (x, y), y = u. 
We see that f € P(B x A); hence, A? C P(B x A). If both A and B are finite, 
we more easily understand the use of the notation A? to represent this set. Suppose 
A = {a,b,c}, a three element set, and B = {0,1}, a two element set. We then list the 
functions in the set A?: 


fi = {(0,a), (1,a)} 
f2 = {(0,a),(1,6)} 
fa = {(0,a),(1,¢)} 
fa : {(0,b), (1, 6)} 
fs : {(0,b), (1,a)} 
fe = {(0,b), (1, e)} 
fr : {(0,¢),(1,6)} 
fs : {(0,¢),(1,a)} 
fo = {(0,¢),(1,6)} 


There are precisely nine elements in AP. Or, we can say that the cardinality of |A?| 
of AP is |A|IPI = 3? = 9. So the notation A? is designed to remind us of the number 
of elements contained in such sets when the sets A and B are finite. For convenience, 
this notation is maintained for sets of all cardinalities. In this section we try to develop 
a few rules that will help simplify expressions involving exponentiation of infinite car- 
dinals. We will soon see that cardinal exponentiation is a considerably more complex 
operation than the cardinal addition and multiplication operations. 


We remind ourselves of the formal definition of cardinal number exponentiation: 


If k and A are the cardinal numbers of the non-empty sets A and B we 
define xò = | A|!8! = |A®|. For convenience we define 0* = 0 and K? = 1. 
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We will begin by showing that exponentiation of cardinal numbers is well-defined. 


Theorem 24.1 Exponentiation on @ is well-defined. That is, if S, S*, T and T* are sets 
such that |S| = |S*| and |T| = |T*|, then |S7| = |S*7"|. 





Proof: 
What we are given: The sets S and S* are equipotent as well as the pair T and T*. 
What we are required to prove: That ST and S*?” are equipotent. 
Since S ~e S* and T ~e T* there exist one-to-one onto functions a : T — T* and 


B:S— S*. 


If g € ST define 
plg) = {(a(t), B(g(t))) :t ET} © P(T* x S*) 
We claim that for any g € ST, (g) € S*T™: 


First note that (a(t), G(g(t))) € T* x S* for all t € T and that the domain of 
{(a(t), B(g(t))) : t € T} is T* = alT]. If (ala), B(g(a))) and (a(b), 6(g(b))) are 
elements of ¢(g) such that 3(g(a)) 4 B(g(b)), then g(a) Æ g(b) (since 8 is a one-to- 
one function on S$). Since g € ST, a Æ b. Since a : T — T* is one-to-one, a(a) # a(b). 
We have shown that G(g(a)) Æ G(g(b)) implies that a(a) 4 a(b). So (g) is a func- 
tion whose domain is T* with range S*. Then, for any g € ST, ¢(g) € S** as claimed. 


We claim ¢ : ST — $*"” is a one-to-one function on ST: 


Since ¢ associates to any g € ST an element ¢(g) in S*T*, œ has domain ST and 
range S*T*. Suppose h, g € ST. 





h#g = J3ucET such that h(u)# g(u) 
<> B(g(u)) Æ B(A(w)) (Since £ is one-to-one.) 
& (a(u), B(g(u))) # (alu), B(h(u))) 
= {(a(t), B(g(t))) :t ET} F {(a(t), B(A(t))) :t € TH 
= plg) # olh) 


So ¢: ST — S*T” is one-to-one on ST as claimed. 
Since ST is embedded in 8*7”. Then |ST] < |S*T"]. 
Using the same arguments but replacing a: T — T* with a“! : T* — T and 3: S — S* 
with 6-! : S* — S we can show that S*7” is embedded in ST. Then |$*7"| < |S7]. 


By the Schréder-Bernstein theorem, |S*7”| < |S7| and |S7| < |S*7”| implies that 
|S7| = |S*?"| as required. 
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24.2 Three basic identities involving cardinal exponentiation. 


We now verify that exponentiation in @ satisfies the usual three basic exponential 
properties. 


Theorem 24.2 Let k, A and ¢ be any three cardinal numbers. Then 
a) Kòt? = Kò x KÊ 
b) (Aaa 


c) (kX A)? =k? x AP. 


Proof: 


What we are given: That «, \ and ¢ are cardinal numbers where « = |S|, A = |T| and 
$ = |U] for sets S, T and U. 


a) Kòt? = kò x k? (where it is assumed that T N U = Ø): 
What we are required to show: That |S? x SY] = |STYU] = |S|ITYU1, 
It suffices to show that ST x SU ~e STYY, 
Let (f,g) € ST x SY. Let hiş}: TUU — S be the function defined as: 


_f fie) if ser 
hipaa) = { g(x) if zeU 
We claim that hgs} € STUY: If hep g}(a) Æ hepg}(b), then either f(a) # f(b), 
g(a) # g(b) or f(a) 4 g(b). Since f and g are functions and TOU = Ø, then for one 
of these three cases, a # b. Then h{ pg} € oe 


Define the function ¢: ST x SY — STVU as 


bf, 9) = hifo} 


We claim that ¢ maps ST x SY one-to-one onto STVU; 

— The function ¢ is well-defined: If (f, g) 4 (k,r) in ST x SY, then either f(x) 4 
k(x) for some x € T or g(x) # r(x) for some x € U; hence, hyp g(x) Æ hik r} (2) 
for some z E TUU. So httg} F ley r}- 

— The function ¢ is onto STVU: Suppose t € STVU, Then ¢(t|r,tlu)(£) = 
hittir tlu}(£) for all x € TUU since T and U are disjoint sets and t|r € ST 
and tly € SY. 
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— The function ¢ is one-to-one: Suppose (f,g) and (k,t) are distinct elements of 
ST x SU. We are required to show that ¢(f, g) 4 ¢(k, t). Suppose 


O(F, 9) = hep gy = heey = OK, t) 
Then f = k on T and g = t on U. So (f,g) = (k,t) on TUU. Since TAU = Ø, 
f= kin ST and g = t in SY. Thus, (f, g) = (k, t) in ST x SY, a contradiction. 
Then ¢(f, g) 4 o(k, t) as claimed. 
We conclude that ST x SY and STVU are equipotent. Hence, 


kò x K? — |.s|!7 x | s\lUI = IST x SU _ | STVU] = Foye = Kat? 


b) (Kò)? = K?: 
What we are required to show: That STU and (S7)" are equipotent. 


For each u € U and f € STU we define the function fu : T — S in ST as 
Falt) = firxtuy(t u) E S 
Then for each u € U, fu maps T into S. That is, 


{fa : fE STU, uc U} ST 


Given f € STU define the function f: U —> ST as follows: 
Qf (u) = f u 
We claim that for each f, @f is a well-defined function mapping U into ST: 


slui) Æ flu) > fu F fu 
> flTx{u}(t u1) A f|Tx{u2}(t U2), toran ter. 
=> f(t, u1) A f(t, u2), foraiter. 


= u £u 
So ¢f:U > ST is well-defined, as claimed. 


We define the relation y% : ST*Y — (ST)Y as 
Yl) = of € (S 


We claim that w is a one-to-one function on S7*U: 
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fxg f (to, wo) Æ g(to, uo) for some pair (to, uo) ET x U. 
Ff \ 7x {uo} (to, Uo) Æ 9l-7x{u9} (to, Uo) 

Puc (to) Fal Juv (to) 

Juo F Juo 

Pf # Pg 


YF) AV) 


t~tv¢e¢tve 


STXxU 


So w is a one-to-one function on as claimed. 


We claim that the function 7 is onto (S7)¥: 
Let ¢ be a function in ($7)¥. Then ¢(u) € ST for all u in its domain U. 


We are required to exhibit a function f € S?*Y such that w(f) = ¢. That is, we 
must find a function function f € STU such that f(u) = fu = (u) forall u € U. 


For each u € U, ¢(u) is an element in ST. Define, for each u € U, the function 
gu Mapping T x {u} into S as 


gult, u) = [u] t), Vee T 


Note that T x U = U{T x {u} : u € U}, the union of a collection of pairwise 
disjoint sets. Define f : T x U — S as follows: 


f =Uf{gu: uE U} 


Since the respective domains of the g,,’s are pairwise disjoint and their union is 
all of T x U, then f : T x U — S is well-defined. Then gu = flrx{u} = fu for 
each u € U. So ¢f(u) = fu = O(u) for each u € U. That is, df = W(f). 

Hence, the function 7 is onto (STU, as claimed. 


So the sets ST%U and (S7)" are equipotent. We conclude that (Kò)? = «>*°. 
c) (Kx A)? = KÊ? x A: 
What we are required to show: That SY x TY and (S x T)” are equipotent. 
We define the function ¢: SY x TY = (S x T)" as follows: 


(fg) =h 


where f € SY , g e TY and h: U —S x T is defined as h(u) = (f(u), g(u)). That 
is, O(f,9)(u) = h(u) = (f(u), g(u)) for all u € U. 
We claim that ¢ maps SY x TY one-to-one onto (S x T)": 
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The function ¢ is onto (S x T)”: Suppose h € (S x T). Then h: U > S x T. 
For any u € U, h(u) = (hı (u), ho(u)). Then (h1, h2) = h; so ¢ maps SY x TY 
onto (S x T)Y. 

The function ¢ is one-to-one: Let (fj,g1) and (fo, g2) be pairs of functions in 
SU x TY and q and r be functions mapping U into S x T defined as q(u) = 


(filu), gı(u)) and r(u) = (f2(u), ga(u)). 
Then 


elfi 91) # O(f2,92) © q#r 

= q(u)Ar(u), for some u € U 
= (filu), gi(u)) # (f2(u), g2(u)) 
= filu) # fo(u) or gi(u) A go(u) 
S fiF fzor g #92 

= (fig) # (fe, 92) 

Then the function ¢ is one-to-one. 


We conclude that ¢ maps SY x TY one-to-one onto (S x T)” and so these two sets 
are equipotent. 


The following example shows how these identities can help simplify the computation 
of cardinal exponentials. 


Find the cardinality of R®. 


Solution: 


[R] cE 
= Ne 
go xe 


The function f(x) = (0,2) € {0} x R embeds R in N x R; hence, c < No x c. Since 
No < c, then, by theorem 23.4 e), No x c < c x c. By corollary 20.4, c x c= c. We 
then have 

c<Noxc<cxc=c 


which implies No x c = c. Then 


IR] — 2Noxc = 9¢ 


A few basic inequalities involving cardinal exponentiation. 


We verify a few more basic cardinal exponentiation properties. 
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Theorem 24.3 Let «, A, and a be infinite cardinal numbers. Then 


a) 
b) 


c) 


k < KA 
a < k > aà < kò 


a< A> kl < KA 


Proof: 


What we are given: That « = |K], a = |A| and à = |L]. 


a) 


k < kò: 


What we are required to show: That K is embedded in K4. 


Define the function f : K — K+ as follows: f(k) = {k}} c K*. Note that {k}} 
contains only one function; it maps all elements of L to the single element k. Since 
“k Æ t implies {k}/ 4 {t}””, the function f is one-to-one. Since f embeds K in K+, 
then «K < kô. 


ax<Kk> a < kò: 
What we are also given: That A is embedded in K. 


What we are required to show: That A” is embedded in KY”. 
Suppose the function f : A > K embeds A into K. Define ¢: A” — K x L 


pg) = {0 FG) LE L}CLx K 


We claim that 6(g) € KŁ : 


If (a, f(g(@))) and (b, f(g(b))) are elements of ¢(g) such that f(g(a)) # f(g(b)), then 
g(a) # g(b) (since f is a function mapping A to K). Since (a, g(a)) and (b, g(b)) 
both belong to g € A’, then a Æ b and so ¢(g) is a function in K” as claimed. 


We claim ¢: A’ > K” is one-to-one: 


Suppose h, g € A”. 


hžg @ Ju 
Sef 
S 
= { 
=> 





€ L such that h(u) 4 g(u) 

(u)) # f(h(u)) since f : A — K is one-to-one. 
»F(g(u))) # (u, F(ACu))) 

f(g@))) 1E L} AL, f(AM)) se L} 

)# (h) 


So ¢: AY — K® is one-to-one as claimed. Since 6: AY — K” embeds A” into K+, 
then aò < kò as required. 


(g 


(i, 
blg 
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c) (AC a= NS K? < KA: 
What we are also given: That A is embedded in L. 
What we are required to show: That K^ is embedded in K}. 
Suppose the function f : A L embeds A into L. Define ¢: K4 — f[A] x K as: 


olg) = {(f(a), 9(a)): aE A} C f[A]x KCLXK 


We claim that ¢(g) € KFM: 


If (f(a), g(a)) and (f(b), g(b)) are elements of ¢(g) such that g(a) 4 g(b), then a 4 b 
(since g € K4). Since f is one-to-one mapping A into L, a Æ b implies f(a) Æ f(b). 
Then ¢(g) € KT as claimed. 

We claim ¢: K4 — KTA is one-to-one: 


Suppose h, g € K4. 





h#g = JucE A such that h(u) 4 glu) 
= (F(u), g(u)) # (Fu), h(w)) 
& {(f(a),g(a)):a€ A} # {(f (a), h(a)): a € A} 
> (9) Æ oh) 


So ¢: K4 — KIA] is one-to-one as claimed. 
For each function h € KT the axiom of choice allows us to choose a function h* € K} 
such that h*|¢,4) = h. We define the function ¢* : KA — K+ as ¢*(g) = o(g)*. Since 
6: K4 — K/IAl is one-to-one, then ¢* : K4 — K® is one-to-one. 
We conclude that K^ is embedded in K4. This implies k® < Kò as required. 


Concepts review: 
1. If x and À are two cardinal numbers how is the expression xò defined? 


2. What are the three basic identities for cardinal exponentiation stated and proved in 
this section? 


3. What are the three basic inequalities for cardinal exponentiation stated and proved 
in this section? 


EXERCISES 


A. 1. Show that for any cardinal number «: 


a) kl =k. 
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b) ae= 1: 
c) k? =1 
d) 0€ = 0, if k > 0. 


2. Show that for any finite cardinal number n and any cardinal number x: 
a) (230)” = 230 
b) N = Xo 
c) ito = 20 
d) no = 20, 
e) (250)5o = 250 
3. Let k be an infinite cardinal number. Suppose |K| = « and that {K; : i € K} 
is a set of pairwise disjoint sets K; each of which has cardinality «. Show that 
|U{K;:ic K} =k. 


4. Show that if the cardinal number « 4 1, then Kf Ær. 
5. Show that 250 < 2(2"°), 
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Summary. In this section we examine a few sets whose cardinality is the same 
as the cardinality of R. In particular, we determine the cardinality of the set 
of all one-to-one functions mapping N to R and the cardinality of the set of 
all continuous real-valued functions. We also study the well-known Cantor set, 
discuss its construction and prove that it has cardinality c. 





25.1 Sets related to R. 


It can sometimes be a challenge to determine the cardinality of certain sets. Many of 
the cardinal arithmetic principles presented in the last section will help us determine 
the cardinality of sets associated to R. 


The cardinality of finite products of the reals, of the complex numbers and the irra- 
tional numbers are discussed in the following theorem. 


Theorem 25.1 Let C denote the set of all complex numbers and J denote the set of all 
irrational numbers. Let n denote the cardinality of a non-empty finite set. 


a) The cardinality of R” is c. 

b) The cardinality of C is c. 

c) The cardinality of J is c. 
Proof: 


a) |R"|=c: 
To prove that |R”| = c it suffices to show that c” = c. 
We will prove this by mathematical induction. 


What we are given: That n is a natural number greater than zero. 
What we are required to show: That c” = c. 
Let P(n) be the statement “c = œ”. 
— Base case: Trivially, P(1) holds true. 
— Inductive hypothesis: Suppose P(n) holds true. That is, suppose c” = n. Then 


cot = c” x cl (theorem 24.2 a).) 


= CXC (By the inductive hypothesis.) 


= C (corollary 20.4) 


So by mathematical induction c” = c for all finite non-zero cardinals n. So |R”| = c. 
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b) 


25.2 


|C| = c: 


Define the function f : R? — C as f(a,b) = a + bi. The function f is easily shown to 
be one-to-one. So R? and C are equipotent. It follows that |R?| = |C| = c. 


|J| = c: 

Suppose « = |J|. Since JU Q = R and JN Q = Ø, then x + Xo = c. If k < No, then 
k +No = Xo Æ c. So No < Kk. That is, k is an uncountable set. By theorem, 20.6, 
k +No =k. Hence, k = c. That is, |J| = c as required. 


The cardinality of sets of sequences and functions. 


Sets of sequences and functions are more abstract in nature. This sometimes makes it 
more difficult to determine their cardinality. The following theorem illustrates a few 
strategies that can be used to determine the cardinality of such sets. In the proof of 
the following theorem we will invoke the statement, Nx N ~ N, proven in theorem 19.4. 


Theorem 25.2 


a) 


b) 


c) 


Let r denote the set of all countably infinite sequences of real numbers. Then the 
cardinality of Ap is c. 

Let y denote the set of all countably infinite sequences of natural numbers. Then 
the cardinality of Ay is c. 

Let Ni-1) denote the set of all one-to-one functions mapping N to N. Then the 


cardinality of Nic is c. 


d) Let Rù- denote the set of all one-to-one functions mapping N to R. Then the car- 
dinality of RÌ -1 is c. 
Proof: 


a) 


|r| = c 

A sequence of real numbers {aọ,a1,a2,...} is a function s : N — R mapping each 
natural number 7 € N to a; € R. So each infinite sequence {aọ, a1, a2, . . .} is associated 
to a unique function s : N — R. So the set of all infinite sequences of real numbers 
can be represented by RN. Then 


CES 
= 9XoxXo (By theorem 24.2) 
— 9 IN| = 2X0 (By theorem 19.4) 


Then |Ag| = |RÄ]| = c. 
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b) [A] = c: 
A sequence of natural numbers {aọ, a1, a2, ...} is a function f : N —> N mapping each 
natural number 7 € N to a; E€ N. So the set of all infinite sequences of natural numbers 
can be represented by NN. The cardinality of the set of all infinite sequences of natural 
numbers is then |NN|. Note that 


feNN = fCNxN 
=> fe P(NxN) 


=> NC AINxN) 
Then 
c = Qo 


Np (By theorem 24.3 b).) 

IN“ 

| P(N x N)| 

| API(N)| (N x N ~ N followed by 20.7.) 
= |RiS< 


A Il IA 


We conclude that |N‘| = |A] = c. 
c) INi -nl =e 
Let f € NN. Consider the set 
Sr ={(n, (n, f(n)):n EN} CN x (Nx N) 


We claim that Sp is a one-to-one function mapping N into N x N: 
If (n, (a, b)) and (n, (c, d)) belong to Sp, then a = n = c and, since f is a function, 
b = f(n) = d; hence, (a,b) = (c,d). So Sp is a function mapping N into N x N. If 
(a, (n, f(n))) and (b, (n, f(n)) belong to Sp, then a = b = n. So Sp:N—Nx Nisa 
one-to-one function, as claimed. 

We know that N x N ~e N. Then there exists a function h mapping N x N one-to-one 


onto N. Since both h and Sf are one-to-one, then, for each f € NN, heSs:N — Nis 


a one-to-one function and so belongs to i 


Let H : NN > Nù -1 be a function defined as H(f) = h°Sp. We claim that H is 
one-to-one: 


fAg = f(m) FgQ(M) forsome natural number m 

S¢(m) = (m, f(m)) F (m, g(m)) = Sg(M) for some natural number m 
h(S(m)) Æ h(Sg(M)) for some natural number m 

hoSf £ heSg 

H(f) # H(g) 
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25.3 


So H is one-to-one as claimed. 
Then the function H embeds the set NN into Nia): We conclude that |N‘| < IN}. 
Since INi -nl < |N¥|, then by the Schréder-Bernstein theorem ING»! = |NN| =c. 














N CEI 
Rà- = 
By part ©) Nà- =c. Since Nici C Rii then IRQ»! > c. Also given that 
IRG-pl < IR“ 
= RIN 
= hy 
9XoxXo 
= 2o 
= ¢ 
So IR -nl =c. 


The Cantor set. 


The Cantor setis an inductively constructed subset of the closed interval [0, 1]. Be- 
cause of its interesting properties it is often discussed in various branches of mathe- 
matics. We will present the steps for its construction and discuss its cardinality. 


In what follows, the expression (a,b) will mean the open interval in [0, 1], with end- 
points a and b. We will construct a countably infinite set of subsets {Cn : n € N} 
where each Cn is defined as follows: 


Cı = Co- (1/3, 2/3) 
Cə = Cı- [(1/9, 2/9) U (7/9, 8/9)] 
C3 = Co —[(1/27, 2/27) U (7/27, 8/27) U (13/27, 14/27) U (25/27, 26/27)] 


The construction can be summarized as follows: Ch+1 is obtained by “punching out” 
open middle thirds from each closed subinterval in its predecessor Cn. Actually con- 
structing Co to C3 will allow one to see the pattern of construction and develop a 
mental picture of what each Cn looks like. 
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Pursuing this process an infinite number of times will yield a countably infinite set 
{Cn : n € N} of subsets of the closed interval [0, 1]. For every natural number n, we 
see that Cn+1 C Cn. Furthermore, for each natural number n, Cn will be the union 
of 2” closed intervals. We will index these closed intervals with sequences of n zeroes 
and ones as follows:? 


Co = [0,1] 

Cr = fo Viloy 

C2 = algo, U 2110,1} U af 41,0} U olay 

C3 = 32 40,0,.0; U 3{{0,0,1} U 32 40,1,03 U 32 40,1,13 U 32 41,0,0} U 32 4¢1,0,13 U 32 41,1,0} U 32 gr 113 


C4 = al{o,0,0,0} U 41 40,0,0,13 U 41 f0,0,1,0} U 42 40,0,1,13 U 42 {0,1,0,0} U 41 f0,1,0,13 U 42 40,1,1,0} 
Ual {0,1,1,1} U 42 {1,0,0,0} U 44 {1,0,0,1} U 42 {1,0,1,0} Y 44 {1,0,1,1} U 42 {1,1,0,0} 
Ual¢1.1,0,13 U 4£42,1,1,03 U a2 41,1,1,1} 


For example, 


C2 = (0, 4] U lá, 3] U [§, 3] U (5, 3] 
2110,0} U 2110,1} U 2141,0} U 2141,1} 


N 


1o} U F41} 
[0, 3] U 53] 
= Cj 


More generally, for each n € N, 


Cn = Unk: t € {0,1}"} 


2Some readers may want to refer to the natural numbers expressed in base 2: 0, 1, 10, 11, 100, 101, 110, 
111, ..., to determine the order in which the finite zero-one sequences are ordered. 
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Since C,,41 C Cn for each n € N, the sets {Cn : n € N} are said to be “nested”. We 
call the intersection 
C=(\Cr 


nen 
the Cantor set. 


We begin by listing a few facts about the Cantor set, C, and its construction. 
— Note that if A is a sequence of n zeroes and ones and „T4 is one of the intervals 
which forms Cn, then, by the construction rules of Cn+1, 
ntitLaugo} U nti lau} C nta 


For example, 


31,0,0,.0} U slt0,0,13 C  2Lf0,0} 
320,10} U alroaay C 2Lto14 
311,00} U 3441,0, C 241,0} 
sloio YU slay C 2441,1} 


Then every level Cn = CoN Cy NC2M---MAC), can also be obtained by taking the 
finite union of the intersections of nested sets of closed intervals 


n 


[0, 1] A [0, $] A [0, =] N [0, $] N -+A [0, 340] 


If {,J4 : n € N} is an infinite set of nested closed intervals, then Nnen{nTa} 4 9.4 
At any level n there are 2” closed intervals in C,; 2” is precisely the number of 


different sequences containing n zeroes and ones. 


— The length of each closed interval which forms Ch is J- For example, the set C3 


is the union of closed intervals of the form |Z, net). If {n]a : n € N} is an infinite 
set of nested closed intervals, then the length of the interval N,en{nZa} must be 
limno a = 0. From this we must conclude that the non-empty set QNnen{nla} 
cannot contain any more than a single element z. 

— Notice that every endpoint in Cn will still appear in Cn+1. Once an endpoint ap- 
pears in Cn for some n it is never removed in subsequent steps. For example, since 
4 € Co, then 4 E€ Chn for all n > 2. So if En, denotes the endpoints of the 2” closed 
intervals in Cn, En C Mren Cn. So the countably infinite set UnenEn is contained 
in C. So the Cantor set contains infinitely many points. 


1This statement is referred to as the Nested interval lemma. This lemma is proven in most Calculus 


texts. 
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We will show that the Cantor set C is, quite surprisingly, an uncountably infinite set, 
this in spite of the large amount of points removed from [0, 1] to construct it. There 
are various ways of proving that C is uncountable. We provide a proof that has a 
set-theoretic flavour to it. 


Proposition 25.3 The Cantor set has cardinality c. 


Proof: 


Since the Cantor set C is a subset of [0, 1], then the cardinality of C cannot be larger than 
c. We will show that |{0,1}] < |C| where the cardinality of {0,1}‘ is known to be c (see 
theorem 20.12). 

Let s be a specific sequence in {0,1}. For each n € N, let s(n) denote the finite sequence 
made of the first n terms (of zeroes and ones) of the infinite sequence, s. Then for each 
n EN, ntilenti) C nisn) and so the set {nlsm) : n € N} forms a set of nested closed 
intervals (uniquely determined by the sequence s) obtained by applying the construction 
algorithm of C. The Nested interval lemma guarantees that M729 nJs(n) is non-empty for 
each s € {0,1}‘. For each s, we can then choose an element x, in N° o nl s(n) (To do 
this we invoke the Axiom of choice.) See that, since £s € nIs(n) C Cn for all n, then 
£s E NX _gCy = C. We define the function f : {0,1} s C mapping into C, as f(s) = 2s. 
We claim that f is one-to-one: Suppose s and t are distinct elements of {0,1}N. Let n be 
the least natural number such that s(n) 4 t(n). Then the two closed intervals 


nd s(n) and ntt(n) 


in Cn have empty intersection. Then the intersection of the two sets of nested closed 
intervals 


{f(s)} = {£s} © N{nIs(n) : n E N} and N {niin : n E N} 2 {xe} = {f6} 


must be empty. So x, and x; cannot the same element. This shows that f is one-to-one, 
as claimed. Then f embeds {0,1} into C. Hence, c = |{0,1}%] < |C| as claimed. Since 
C C R, |C| < c. We conclude that |C] = c. 


It is surprising to see that the cardinality of C is the same as the cardinality of R 
since, to obtain C from [0,1] we removed from [0, 1] a total length of open intervals 


equal to 
1 2 4 1 2 2 4 
— + + Eee ae E = 1 


Ba 3 Bi Be 1-2 





There are still uncountably many points that are left behind. One may expect that C’ 
is simply the set of all endpoints that appear in all C;,’s. But this can’t be, since if we 
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take the union of all endpoints U en En we obtain only a countably infinite set, while 
C is uncountable. The Cantor set must then contain uncountably many numbers 
which are not endpoints! Skeptical readers may want to look at the proof again to 
see if there is any sleight of hand. Even if one believes the given proof, it does not 
meant that it will necessarily be what we might call “a satisfying proof”. We cannot 
actually see what is going on at the very high levels of n. The proof doesn’t help 
us understand why the “non-endpoints” in C are not excluded in the construction 
process. 


Identifying numbers in C which are non-endpoints. The following arguments show 
why some “non-endpoints” of C remain in the infinite intersection of the sets, Ch, 
which are used to construct the Cantor set C. Consider the sequence of numbers, 


{Sn : n € N}, where 
n —1 k 
os = 
¥ (3) 


k=0 
By carefully examining this sequence we can deduce the following facts: 


— We see that So = 1, S1 = 1-4, S2 =1-4 +5, $3 =1-4 +4- 5, and so on. 


— The subsequence {Sən : n € N} is a strictly decreasing sequence while the sub- 
sequence {52,41 : Nn € N} is a strictly increasing sequence. 


— We also see that, for all n € N, [Son41, San] C [S2n-1, S2n—2], hence the set 
{[Son41, San] : n E N} forms a nested set of closed intervals. 


— For each n, [Sn41, Son] C Con41- 
— Since the elements of the sequence {Sn} are partial sums of a geometric series 
I-T 
sequence {Sn} is then computed to be 3 where, for all n, Soni < 3 < Son. 
— We deduce that 


n ifa n+l 
with common ratio r = => then Sn = a = Loe The limit of the 


{3/4} = A{[Son41, San] :n EN} CA{C,:n EN} HC 
So, even though 3/4 is not an endpoint of one of the subintervals which form each 


level Cn it belongs to the Cantor set. Other such points can be found in C in this way. 


Counting those elements of A(R) of cardinality c. 
In what follows we will let 2 denote the set 
Me ={S CR: |S| = |R| =c} 


Since {R — {x}: x E€ R} C Rand m C A(R), then c < |A| < 2°. We know that 
the cardinality of A(R) is 2'*! = 2°. We claim that |2| = 2°. 
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Recall (from corollary 20.4) that |R x R| = |R| = c. Then to each subset S C R x R 
of cardinality c we can associate a unique subset S* C R of cardinality c. Then 


|2| =S E RxR: |S|= c} 
For any non-empty subset U of R and z € U, 
c= |{x} x R| < |U x R| = |UxR|>cYUCSCR 


Since 


HU xR:U € A(R) — {Ø} = 2° 
then 


|| =HSCRxR:|S| =c} > HU xR:U € A(R) - {o}}| =2° 


So |&| = 2°, as required. 


Counting all real-valued continuous functions on R. 


Continuous real-valued functions on R can be characterized as being those functions 
which satisfy the property 

for any sequence {£n} of numbers which converges to a number x. Since every irra- 
tional number is the limit of a sequence of rational numbers, the value of a function f 
at an irrational number x is uniquely determined by the value of this function at all 
rational numbers which surround it. This means that given any continuous real-valued 
function, f, on Q, the continuous function f* on R such that f*|g = f is unique. In 
set-theoretic language, this means that the sets, Z = {f € RB : f is continuous} and 
Q={f © R: fis continuous}, are equipotent. So to determine the cardinality of 
the set Z it suffices to determine the cardinality of the set 2. It is shown in theorem 
25.2 part a) that the cardinality of the set RN of all functions mapping N into R is 
c. From the equipotence relation Q ~e N we deduce that R ~e RN. Since 2 c RY, 
|Z| < c. The uncountable set {f € R2: f = c,c € R} of constant functions is a 
subset of the set 2. Then c = |{f € RL : f = cc € R}| < |2|. So the cardinality 
of 2 is c. We conclude that the cardinality of the set of all real-valued continuous 
functions on R is c. 








Concepts review: 


1. What is the cardinality of R” for any natural number n? 
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2. Do the real numbers and the complex numbers have the same cardinality? 


3. How does the cardinality of the set of all countably infinite sequences of real numbers 
compare with the cardinality of RE? 


4. What is the cardinality of the set of all irrational numbers? 


5. What is the cardinality of the set of all countably infinite sequences of natural num- 
bers? 


6. What is the cardinality of the set of all countably infinite sequences of real numbers? 


7. Let NN denote the set of all functions mapping N into N and NÌ } denote the set of 
one-to-one functions mapping N into N. Are the sets NN and Ney equipotent? What 
is their cardinality? 


8. What is the Cantor set? How is it constructed? What is its cardinality? 


9. What is the cardinality of the set of all continuous real-valued functions? 


EXERCISES 


1. Show that for any finite cardinal n, n x 9 (20) — 9(2%0) 


2. Let A(N)r denote the set of all finite subsets of N. 
a) Show that the set N is embedded in A(N)r. 
b) Express A(N)r as the union of a countably infinite number of pairwise disjoint 
subsets of N. 
c) What is the cardinality of the set UnenN*? 
d) Construct a one-to-one function which embeds A(N)r in UpenN*. 
e) What is the cardinality of A(N)r? 
3. Let A(R)r denote the set of all finite subsets of R. 
a) Show that the set R is embedded in A(R)r. 
b) Express A(R)r as the union of a countably infinite number of pairwise disjoint 
subsets of R. 
c) What is the cardinality of the set UpenR*? 
d) Construct a one-to-one function which embeds A(R)r in UperR*. 
e) What is the cardinality of A(R) Fr? 


4. Consider the strictly increasing sequence S = {2!, 27, 23,24,...,2”,...} of cardinal 
numbers. 
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a) Does the cardinal number 2% belong to S? Why? 


b) Consider the two infinite cardinal numbers Nọ and 2*°. Is one of these a least 
upper bound of $?! If not say why. If so which one? 


1We remind the reader of the definition of “a least upper bound of an ordered set S”: The element m is 
a least upper bound of a set S if m is an upper bound of S and for any other upper bound n, m < n. 
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26 / Well-ordered sets. 


26.1 


Summary. In this section we review a few basic notions about well-ordered 
sets. We define special subsets of well-ordered sets called “initial segments”. 
“Order isomorphisms” are defined as strictly increasing one-to-one functions 
between well-ordered sets. Initial segments of well-ordered sets are themselves 
well-ordered sets. We prove basic properties of order isomorphisms between ini- 
tial segments and well-ordered sets. We then define the relation, < wo, on well- 
ordered sets as follows: “S <wo T if S and T are order isomorphic or one is 
order isomorphic to an initial segment of the other”. This section provides the 
fundamental background for the study of the important set-theoretic topic of “or- 
dinal numbers”. 


Overview. 


In the last few sections, we have familiarized ourselves with some of the main prop- 
erties of infinite sets. We have seen that the ZFC-axioms have cleared a path into 
unfamiliar mathematical territory, populated by uncountably many “infinite sets” in 
infinite varieties, leading us to reflect on numerous counterintuitive notions. We have 
discovered, for example, that given any infinite set A we can find another infinite set 
B = P(A), not equipotent to A, which properly contains a one-to-one copy of A. 
To express this relationship we said that A is “properly embedded” in B and wrote 
Ae B. We can thus construct infinite chains of sets linearly ordered by the proper 
embedding <,-relation. For example: 


OG, 1 oe 2 Oe e N >e ZPN) — A(P(N)) >e A(A(A(N))) =e 





This chain of sets ordered by <>, begins with the empty set 0 = { }. This set is 
followed by an infinite number of finite sets called the “natural numbers”. Once we 
attain the first infinite set N, an endless sequence of infinite sets can be constructed 
by successively taking powers of a set. Note that no natural number is constructed 
by taking the power set of its immediate predecessor. So the method for constructing 
each natural number from its predecessor is different from the method used to con- 
struct each new infinite set. In fact, it is an axiom that allows N to exist. Another 
axiom allows us to call the power of a set, a “set”. Of course, various chains of sets can 
be constructed in this way, each depending on the choice of the first set. If we started 
with the set of all real numbers, R, we then obtained what initially appeared to be a 
different chain of infinite sets, R >e A(R) >e A(A(R))---. It was then determined 
that R and A(N) are in fact equipotent and so the displayed chain containing power 
sets of N contains copies of the R-related power sets. 
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We thought it would be practical to partition the class of all sets into subclasses of 
mutually equipotent sets. These subclasses were seen to be equivalence classes in- 
duced by the equipotence relation ~e. We defined the notion of ~e-equivalence class 
representatives called cardinal numbers. A cardinal number was declared to be a set 
which represents all sets which are equipotent to it. We had to postulate the ex- 
istence of the cardinal numbers with the promise that once we have developed the 
required set-theoretic tools, the cardinal numbers would be appropriately defined or 
constructed. 


We have seen that the set of all natural numbers has been extremely useful in de- 
termining various properties of countably infinite sets. A critically important tool in 
our study was the principle of mathematical induction over N. Since any countably 
infinite set is a one-to-one image of N, this means that the elements of such sets can 
be indexed by the elements of N. Indexing countable sets in this way allows us to 
linearly order these sets. We can then apply the principle of mathematical induction 
to determine some of their properties. When working with uncountable sets, we do 
not yet have access to uncountable well-ordered sets whose elements can be used to 
index such sets. We will soon see that ZFC provides the necessary ingredients to 
construct “universal indexing sets”. ! 


Well-ordered sets revisited. 


Recall that “order relations” on a set S are relations which fall into two major cate- 
gories: linearly ordered relations or partially ordered relations (also, non-linearly or- 
dered relation). Each of these can be strict or non-strict order relations. Non-strict 
ordered relations are often represented by “<”, while strictly ordered relations are 
often represented by “<” although other symbols may be used. An order relation on 
a set, S, is linear provided provided any two distinct elements of S are comparable 
under the given order relation. That is, all elements of S' can be lined up on a line, 
the “larger” elements normally to the right of (or above) the “smaller” ones. Partially 
ordered classes are often viewed as having many branches where elements on different 
branches are not comparable by < or <. It is often said that non-linear order relations 
have many “chains of elements” (subsets which are linearly ordered) while a linearly 
ordered class has all its elements lined up in a single chain. 


In what follows the hypothesized sets, S, will be linearly ordered by < or <. We 
will be investigating those linearly ordered sets which are “well-ordered”. We remind 
ourselves of what “well-ordered” means: 


A well-ordered set is a set, S, which is linearly ordered by < or < in such 
a way that every non-empty subset, T, of S contains its least element. If a 


1 These sets will be called ordinals (soon to be defined). Cardinal numbers will be defined as being those 
ordinals which satisfy a specific property. Until we formally define “cardinal numbers” we will refrain from 
referring to the notion of “cardinality of a set” in the process that leads to this definition. 
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relation < well-orders a class or a set S' we will sometimes, more succinctly, 
say that “S is <-well-ordered”. 


Well-ordered classes. Note that in the above definition of “well-ordered set”, we can 
replace the word “set” with the word “class” , so that we can speak of an ordered proper 
class, A, which is well-ordered by some relation, <. Proper well-ordered classes will 
be discussed further on in the text. 


Before we start we should recall that the set, N, as well as every natural number, n, 
were shown to be €-well-ordered, (see theorem 14.3 and corollary 14.4). The set, N, 
of all natural numbers and any natural, n, are also C-well-ordered. We will invoke 
this fact to show that given any non-empty countable set S, we can define an order 
relation which well-orders S. 


Theorem 26.1 Let f : T — S be a one-to-one function mapping T onto S. If T isa 
well-ordered set, then T induces a well-ordering on S. Hence, every countable set can be 
well-ordered. 


Proof: 


What we are given: There exists a function f : T — S which maps the well-ordered set 
(T, <r) one-to-one onto the set S. 
What we are required to show: There exists an order relation which well-orders the set S. 


Since f : T — S maps T one-to-one onto the set S, we can then index the elements of S 
as follows: If s = f(n), express s as sn. Then S = {sn : n € T} = f[T]. 
We define the relation “<s” as 


Sn <s Sm if and only if n <r m 


We claim that <s well-orders S: 


— The set S is <g,-linearly ordered: It is clear that since f is one-to-one onto, <s is 
irreflexive and asymmetric. For transitivity, we see that 


Sn <s Sm and Sm <s Sr => N <r mand m <rr 
> Neer 
> Sn <g Sr 


We now verify that every pair of elements in S are comparable under <s. If Sn, Sm € S, 
then n and m are the unique corresponding elements in T. Then n <p morm <r n. 
Hence, either Sn <s Sm Or Sm <s Sn. Hence, all pairs of elements of S are <s- 
comparable and so S is <,-linearly ordered. 
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— The set S is <s-well ordered: Suppose A = {s; : i € U C T} is a non-empty subset of 
S. Then U is a non-empty subset of T. Since T is well-ordered, U has a least element, 
say k. Since k <r i for all i € U, then sk <s si for all s; € A. Thus, A contains a 
least element. 

This proves that the relation, <s, induced on S by T is a well-ordering. 

We now show that every non-empty countable set can be well-ordered. Let S be a count- 

ably infinite set. Then there exists a function, f : N — S, mapping N one-to-one onto S. 

Since N is well-ordered, then S$ has a well-ordering. 

If S is finite and non-empty, then it is the one-to-one image of some natural number 

n (18.7). Since every natural number n is €-well-ordered (14.4), S inherits this well- 

ordering from n as described above. 


We provide a few examples of linearly ordered sets which are well-ordered and some 
that are not (some of which we have seen before). 


a) The set of all even natural numbers, Ne, with the ordering inherited from (N, C) 
is a well-ordered set since every pair of even numbers are comparable and every 
subset of even numbers contains a least even number. 


b) Every natural number, n, is a well-ordered set. For example, 5 = {0, 1, 2,3, 4} is 
€-linear (or C-linear) and every subset of 5 contains a least element. 


c) The set of all countably infinite sequences of natural numbers, NN, equipped 
with the lexicographic ordering! has been shown to be a set which is linearly 
ordered, but not well-ordered, since it contains subsets with no least element. 
For example, suppose that for each i € N, x; = {a; : j € N} where aj = 1 
if j = i and a; = 0 otherwise. Then for each i € N, x; € NN. The subset 
S = {x; : i © N} of NN does not contain a least element since it does not contain 
the element (0,0,0,...,). 


d) The set, N x N, can also be equipped with the lexicographic ordering: 


1 (0,0), (0, 1), (0,2), ..., (1,0), (1,1), .. ., (2,0), (2, 1), (252 C2 By, sap 


When ordered in this way, N x N can be seen as being the union of a countably 
infinite number of copies of N lined up from end to end. This is easily seen to be a 
linear ordering. Given any non-empty subset M = {(s,t):s E S,tET} of NXN, 
the least element of M is (u, v) where u = least{S} and v = least {t : (u, t) € M}. 
The element, (u, v), belongs to M since both u and v are least elements of subsets 
of a well-ordered set. We conclude that the lexicographically ordered N x N is 
well-ordered. 


e) The set, R, of real numbers equipped with the usual real number ordering is 
linear but is not well-ordered since the set {x € R : x > 1} does not have a least 


1See the definition of lexicographic ordering on page 130. 
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element. Note that this does not mean that there isn’t an order relation which 
well-orders the real numbers. 


Our experience with well-ordered sets is quite limited. Even if we can use the lexico- 
graphic ordering tool to construct long chains of copies of N we have however not been 
able to exhibit a single uncountably infinite set which is well-ordered. Anyone who 
has attempted to find a well-ordering relation for R may wonder if an uncountable 
well-ordered set exists at all. 


Initial segments revisited. 


The “Initial segments of a set” is another concept we will be revisiting at this time. 
The reader will recall that the notion of an “initial segment” was introduced in 17.1 
as special kind of subset of Q. In that section, we referred to it as a Dedekind cut. 
The set of all Dedekind cuts became the “real numbers”. Initial segments discussed 
here are the same mathematical objects as the ones discussed in the chapter whose 
purpose was to define the real numbers R. However, the context is quite different. 
In this section the initial segments we will study are subsets of abstract well-ordered 
sets. We will discuss the notion of “initial segments” as though we have never seen 
these before. We start with the following formal definition. 


Definition 26.2 Given a well-ordered set (S, <), a subset U of S satisfying the property 


U#S and Vu E€ U, |æ < u] => [x € U] 


is called an initial segment of S. In this definition, the partial order relation < can be used 
instead of < without altering the meaning of “initial segment”. 


Formal definitions of abstract concepts are often not expressed in a reader-friendly 
form. This is because the reader-friendly form is not always the form that is best 
adapted to the process of proving statements in which carefully formulated definitions 
are required. The following theorem will allow the reader to more easily visualize 
what initial segments in well-ordered sets look like. 


Theorem 26.3 If (S, <) is a well-ordered set, then every initial segment in S is of the form 


Sa = {rE S38 iat 


for some a € S. 


?We will see later on that the Axiom of choice guarantees that R can be well-ordered without explicitly 
stating what such a well-ordering could be. 


256 Section 26: Well-ordered sets. 


Proof: 


What we are given: That (S, <) is a well-ordered set; T is a proper subset of S satisfying 
the property “Vt € T, [x < t| > [z € TJ”. 

What we are required to show: That T = Sa = {x E€ S : x < a} for some a € S. 

Since T is a proper subset of S, then S — T is non-empty. So S— T must contain its least 
element, say a (since S is well-ordered). 

Claim Sa C T: Since a is the least element of S- T, x <a =>xgS-T=xeT. So 
Sa C T, as claimed. 

Claim T C Sa: If £ Z Sa then x > a. Then the element x cannot belong to T, for ifa € T, 
a < x would imply that a € T (by definition of the set T); since a € S — T, we would 
obtain a contradiction. So u € T = u < a. That is, T C Sa as claimed. 

So the initial segment, T, of the well-ordered set, S, is the set Sa = {a € S: x < a} where 
a is the least element in S — T, as required. 


Given the initial segment Sa, we will refer to a as the leader of the initial segment. 
The leader, a, of the initial segment, Sa, is not an element of the initial segment. It is 
also important to remember that, by definition, a well-ordered set S is not an initial 
segment of itself. We provide a few examples of sets which are initial segments and 
sets which are not: 


a) Note that every natural number n in N is an initial segment of N. For example, 
5 = {0,1,2,3,4} = {n €N:n < 5} = S; is an initial segment of N. 
— Wecan view 5 = S5 = {0, 1, 2,3, 4} as being C-well-ordered. The initial segments 
of 5 are the following sets only: 


Mt, ay Bea o ar 
SS Be =O, D1 
oS R fo.) 

1 = Sı = {0} 


b) Even though the set Ne of all even natural numbers is a proper subset of N it is 
not an initial segment of N since 26 € Ne and 17 < 26 but 17 Z Ne. However, 
S26 = {n E€ Ne : n < 26} is an initial segment of the well-ordered set (Ne C). 

c) The subset S = {0,2,3,4,5,...,} in N is not an initial segment of N since 3 € S 
and 1 < 3 but 1 does not belong to S. 

d) Consider the set, Nt-12} = {{ao, a1, a2} : a; € N}, of all functions mapping {0, 1, 2} 
into N, ordered lexicographically.! This set is easily verified to be well-ordered.” 


Note that this set is the set of all ordered triples of natural numbers and so is equivalent to the Cartesian 
product Nx Nx N. 
If S is a non-empty subset of NOO125 Jet (bo, b1, b2) be the element in S such that bo, b1, and b2 is the 
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The set S{o,1,0} = {{0, 0, 7} : ¿ € N} is an initial segment of N(0-1,2}. Tt is the set of 
all elements in Nt®12} which are strictly less than {0, 1, 0}. 


Initial segments of well-ordered sets are well-ordered. If Sq is an initial segment of a 
<-well-ordered set S, then Sa can inherit the order relation “<” from S so that it can 
itself be viewed as a <-well-ordered set. 


“Order isomorphisms” between well-ordered classes. 


Given two sets A and B there can be many functions mapping A into B. We may 
want to classify these functions by “types”. For example, we may want to consider 
only those functions f : A — B which are one-to-one, or only those functions which 
are constant, or only those with finite range, and so on. If we are given two lin- 
early ordered sets (S, <s) and (T, <r) we may be interested only in those functions 
f: S — T which “respect the order” of these functions. By “respecting the order” we 
mean that n <s m => f(n) <r f(m). For example, the function f : N — N defined as 
f(n) = 5n respects the order of the elements of the set N (for example, 3 < 4 where 
f(3) = 15 < 20 = f(4)) while the function, g : (0, 1] — N, defined as, g(x) = 1/z, does 
not (since 1/3 < 1/2 and yet g(1/3) = 3 £ 2 = g(1/2)). One-to-one order-respecting 
functions between well-ordered sets will be called order isomorphism. We begin by 
formally defining this concept. 


Definition 26.4 Let f : (S,<,) — (T,<,) be a function mapping a well-ordered class, 
(S,<,), onto a well-ordered class, (T, <p). Note that the symbols <, and <, will allow us 
to distinguish between the order relations applied to the sets S and T, respectively. 


a) We will say that the function, f, is increasing on (S,<,) if 


(£ <s y) > (f=) <r fF) 


b) We will say that the function f is strictly increasing on (S, <) if 


(£ <s y) > (f=) <r FY) 


A strictly increasing function must be one-to-one. 


c) If f : (S, <5) — (T, <+) is strictly increasing, then f is said to be an order isomor- 


En 


phism mapping S into T. 


least element of all first, second and third coordinates of elements in S respectively. Let (x,y,z) € S. If 
bo < x, then (bo, b1, b2) < (x,y,z); if bo = x and bı < y, again, (bo, b1,b2) < (x,y,z); if bo = x and bı = y 
since be < y, then (bo, b1, b2) < (x,y,z). So (bo, b1, b2) is the least element of S. 
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If there exists an onto order isomorphism between the two well-ordered classes, (S,<,) and 
(T, <p), we will say that the classes are order isomorphic, or that a function maps S order 


isomorphically onto T. 


We provide a few examples of order isomorphisms between well-ordered sets intro- 
duced in previous chapters. 


— Let (Ne, <) denote the even natural numbers equipped with the standard natural 
number ordering <. Since the function f : N — Ne defined as f(n) = 2n is 
one-to-one and strictly increasing, then it maps N order isomorphically onto Ne. 

— On the other hand the function g : (N, <) — (N, <) defined as g(n) = n+ (—1)” is 
one-to-one and onto (N, <) but is not an order isomorphism. If g(n) = an, witness 
ao = 1, a, = 0, ag = 3,a3 = 2,.... We see that g does not respect the order of the 
elements. 

— Consider the set N x N x N = { (ao, a1, a2) : a; E N} ordered lexicographically. We 
see that the set 








S(0,1,0) = {(0, 0, i) 1E N} 


is an initial segment of N x N x N since (0,0,2) < (0,1,0) for all natural numbers 
i. Verify that the function f : N — N x N x N defined as f(n) = (0,0,n) maps N 
order isomorphically onto S(0,1,0). 

Does there exist some other order isomorphism which maps N onto S(o,1,0)? (A 
statement proven in the proposition below will confirm that there can be no other.) 


— Suppose <* orders the elements of N as follows: 
- If n is even and m is odd, then n <* m. 


- If n and m are both even or both odd, then n and m respect the usual order 
of natural numbers. That is, 


(N <*) = {0,2,4,6,...,1,3,5,7,...} 


Then the function f : N —> (N, <*) defined as f(n) = 2n, maps N order-isomorphically 
onto the initial segment {0,2,4,6,...,} of (N, <*). 


26.5 Basic properties of order isomorphisms. 


We now list and prove a few basic properties of order isomorphisms. We will refer to 
these often in our study of those sets we will call ordinals. 


Proposition 26.5 Let (S, <s) and (T, <r) be a well-ordered sets. 


a) The inverse of an order isomorphism is an order isomorphism. 
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b) 


If f : (S,<g) — (S,<s) is a strictly increasing function mapping S into itself, then 
x < f(x), forallae S. 


The set S cannot be order isomorphic to an initial segment of itself. 


If f : (S, <s) — (S, <s) is an order isomorphism mapping S onto itself, then f is the 
identity function.! 


If f : (S,<s) — (T,<r) and g : (S,<s) — (T,<r) are two order isomorphisms 
mapping S onto T, then f =g. 


f) Suppose f : (S, <s) — (T, <r) is an order isomorphism mapping S onto an initial 
segment of T. Then S and T cannot be order isomorphic. 
Proof: 


a) 


Given: That f : S — T is an order isomorphism mapping the well-ordered set (S, <s) 
onto (T, <r). 

What we are required to show: That f~! : T — S must also be an order isomorphism: 
To see this, let u,v be elements in T such that u <r v. Since f is one-to-one 
and onto, there exists distinct elements a = f~!(u) and b = f7!(v) in S. Since 
S is well-ordered, it is linear and so all elements in S are comparable. So either 
f(u) =a<g b= f-t(v) or ft (v) =b <g a = f7! (u). Ifb <s a, then, since f is or- 
der preserving, f(b) = v <p u = f(a), a contradiction. Soa = f~t (u) <s f-t(v) =b. 
So f~t : T — S must also be an order isomorphism. 


What we are given: That (S,<) is well-ordered, that f : S — S, and that x < y 
implies f(x) < f(y) (that is, f is strictly increasing). 

What we are required to prove: That x < f(x) for all x. That is, f cannot map an 
element x “below itself”. 


Suppose there exists an element x of S such that f(x) < x. Then, the set 
T = {x € S : f(x) < x} is non-empty. We claim that this will lead to a contradiction: 


— Since S is well-ordered, T must contain a least element, say a. Since a € T, 
f(a) <a. 


— Since f is strictly increasing, 


fla) < a= f(f(a)) < fla) 


— By definition of T, f(f(a)) < f(a) implies f(a) € T. Since a is the least element 
of T,a < f(a). But a < f(a) and f(a) < a are contradictory statements. The 
source of this contradiction is supposing that T 4 Ø. 


We must conclude that T = Ø. That is, for alla € S, x < f(x). 


lAn order isomorphism from an ordered set onto itself is called an order automorphism. Here we are 
stating that the only automorphism is the identity function. 
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What we are given: That (S, <) is a well-ordered set. 

What we are required to show: That S cannot be order isomorphic to an initial seg- 
ment of itself. 

An initial segment of S must be of the form Sa = {x E S: x < a} where a E S. 
If f: (S,<) — Sa is an order isomorphism onto Sa, then f must map a to some 
element f(a) in Sa; this means f(a) < a. By definition of order isomorphism, f is 
strictly increasing on S and so f(f(a)) < f(a). But, by part b) above, x < f(x) for 
all x € S and so f(a) < f(f(a)). The statements f(f(a)) < f(a) and f(a) < f(f(a)) 


are contradictory. So no initial segment of S can be the order isomorphic image of S. 


What we are given: That f : (S,<) — (S, <) is an onto order isomorphism. 

What we are required to show: That f(a) =x for alla € S. 

If f is an order isomorphism from S onto itself, then both f and f~! must be strictly 
increasing functions. By part b) above, x < f(x) and x < f~1(z), for all x € S. 
Suppose s < f(s) for some s € S. Then 


f(s) < F(s) = 8 


The statements f~!(s) < s and s < f~!(s) are contradictory. So there can be no 
element s € S such that s < f(s). Then x < f(x) and x £ f(x) forces f(x) = x for 
all z €s. 


Suppose f : (S, <s) — (T, <r) and g : (S, <s) — (T, <r) are two order isomorphisms 
mapping S onto T. Then f~t : T — S is an order isomorphism and so the function 


fog : (S, <s) > (S, <s) 


is an order isomorphism of S onto itself. By part d) f~!.g must be the identity map. 
Then g = (f1)! = f. 

What we are given: f : (S, <s) — (T, <r) is an order isomorphism mapping S onto 
an initial segment T, of T. 

What we are required to show: That S and T cannot be order isomorphic. 

If S and T, are order isomorphic and S and T are order isomorphic, then T is order 
isomorphic to Ty contradicting the statement in part c). So S and T cannot be order 
isomorphic. 


We highlight some important points that are made in the above statements. 


Firstly, if f : S — T is an order isomorphism between well-ordered sets S and T, 
then there can be no other one. This is important, since it points to a crucial differ- 
ence between equipotent sets and order isomorphic sets. There can be many different 
functions which map a set S one-to-one onto a set T. But if (S,<,) and (T,<,,) are 
known to be order isomorphic, then only one order isomorphism can bear witness to 
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this fact.1 We might say that an order-isomorphism is “sensitive” to the structure of 
a well-ordered set, while equipotence is not. For example, the equipotence relation 
perceives ({0, 1} x N, <iex) simply as a countable set allowing for many ways of map- 
ping N one-to-one and onto this set, while an order isomorphism is sensitive to the 
fact that this set is made of two copies of N lined up one after the other and so cannot 
view this set as a single copy of N. 


Secondly, two initial segments of the same well-ordered set are order isomorphic only 
if they are equal. 


Thirdly, a well-ordered set can never be order isomorphic to an initial segment of 
itself. This again underlines an important difference with the equipotence relation. 
By definition, an infinite set is precisely a set which is equipotent with a proper subset 
of itself. 


Ranking well-ordered sets with order isomorphisms. 


We will show how order isomorphisms can be used to “rank” well-ordered sets. We 
begin by introducing the following notation. 


Notation 26.6 Let S and T be two well-ordered sets. Then the expression 


S wo T 


means “S and T are order isomorphic”. The expression 


S <wo T 


means “S ~wo Ta” where Ta is some initial segment of T. The expression 


S <wo T 


means “S ~wo T or S <wo T”. 


If W ={S E€ Z : S is well-ordered} denotes the class of all well-ordered sets, see that 
the relation ~wo is reflexive, symmetric and transitive on W and so is an equivalence 
relation. See that <wo is also reflexive and transitive on W. The relation <wo is 
not antisymmetric on W in the usual sense, since S <wo T and T <wo S implies 


'Note that even if the order isomorphism between two initial segments is unique, it is still entirely 
possible for an initial segment to be mapped order-isomorphically onto another subset of a well-ordered set. 
For example, the initial segment {0, 1, 2,3} can be mapped order-isomorphically to the non-initial-segment 
A = {11, 12,13, 14}. But note that A is not an initial segment of N. 
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S ~wo T, not S = T.! But <wo can always be used as a ranking too for the elements 
of W. We will show that any two well-ordered sets are <wo-comparable. That is, 
given any two well-ordered sets, S and T, either S <wo T or T <wo S. The reader 
should carefully note how the “well-order properties” are used in various parts of the 
proof. 


Theorem 26.7 Let (5,<,) and (T, <,) be two well-ordered sets. Then either S <wo T 
or T <wo S. 


Proof: 


What we are given: Two well-ordered sets (S, <) and (T, <). The expression Sa rep- 
resents the initial segment {x € S : x < a} whose leader is a. 

We are required to show that: S <wo T or T <wo S 

The symbol Sg ~wo Tp is to be interpreted as “the initial segment Sa of S is order iso- 
morphic to the initial segment Tp of T”. 


We define the function f : S — T as follows: f(a) = b if and only if Sg ~wo Th. We will 
carefully examine this function and describe its properties. 


— We verify that f is well-defined: If f(a) = b and f(a) = c, then Sa ~wo Te and 
Sa ~wo Te. This implies Ty ~wo Te. Two initial segments of the same well-ordered set 
are order isomorphic if and only if they are equal. Then b = c. 


— We verify that the domain of f is non-empty: Suppose Os and Or denote the least 
elements of S and T respectively. If 1g and 1p denote the least element in S — {05} 
and T — {0r}, respectively, then Sig ~wo Tıp. Then f(1s) = 1r so the domain of f 
contains at least the element 1g. Let D denote the domain of f. 


— We verify that f is strictly increasing: Suppose a and 6 are in the domain D of f 
such that a <, b. If f(a) = c and f(b) = d, then Sa ~wo Te and Sp ~wo Ta. Then 
Te ~wo Sa C Sb ~wo Ta. So Te is order isomorphic to an initial segment of Ty. This 
implies Te C Ta > c < d. So f is strictly increasing on D. 


If D = S, then f maps S order isomorphically into T (since f has been shown to be 
strictly increasing on D). 
It follows that D = S <wo T, and we are done. So let us suppose that D # S. 


— We claim that the domain D is an initial segment of S: If u € D, then Su ~wo Tk for 
some k € T. That is, there exist an order isomorphism g : Su — Tk. If £ < u, then 
Sz C Su and gis, : Sz — Tk maps Sy onto an initial segment, say Ty, in Tk. Then 
Sz ~wo Ti implies f(x) =t. Then z € D. So D is an initial segment of S as claimed. 


‘However, if W* = {[S]wo : S € W} denotes the class of all equivalence classes induced by the equivalence 
relation, ~wo, on WY, then the statement in the theorem will allow us to conclude that <wo induces a linear 
ordering on #*. 
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— We claim that if f[D] AT, then f[D] is an initial segment of T : Let v € f[D]. Then 
there exists an element a € D such that f(a) = v. This implies that Sa ~wo Ty. Let 
u<vinT. Then T, C Ty. Since Sg ~wo Ty, then T, is order isomorphic to an initial 
segment Sp C Sa, for some b € D. Then f(b) =u. So u € f[D]. Since f[D] 4 T, by 
definition, f[D] is an initial segment of T as claimed. 


— We claim that f|D| = T: Suppose not. Recall that D is an initial segment of S and 
so there exists q € S such that D = S4. Then, as we just showed, f[D] = f[S | must 
be equal to an initial segment, say T,, for some r € T. Then, since f is an order 
isomorphism, Sz ~wo T;. This means that f(q) =r. So q € D. But this contradicts 
the fact that D = Sq = {x E S: x < q}. We must conclude that f[D] = T as claimed. 


We have thus shown that either f maps S order isomorphically into T or f order iso- 
morphically maps an initial segment D of S onto T. From this we conclude that for any 
well-ordered sets (S, <s) and (T, <r), either S <wo T or T <wo S. 


The above theorem states that if we gather all well-ordered sets together to form a 
class of sets, we can rank them with the relation <wo. Note that in this class, dis- 
tinct well-ordered sets may well be equal or equipotent sets. For example, the set 
(N*, <) = {0,2,4,...,1,3,5,7,..., } of all natural numbers where the even numbers 
are first enumerated in the usual order followed by all odd numbers enumerated in the 
usual way, is simply another way of describing the set N. But, nevertheless, N <wo N*. 
Even though N and N* are the same set, they are not order isomorphic. On the other 
hand, we easily see that N* and the lexicographically ordered set {0,1} x N are order 
isomorphic. 


It is also interesting to note that every well-ordered set is an initial segment of another 
well-ordered set. Indeed, if S is well-ordered by <, then S' is order isomorphic to the 
initial segment {1} x S of the lexicographically ordered set {1,2} x S. In relation to 
such lexicographically ordered sets we present the following more general result. 


Proposition 26.8 For every natural number n, the lexicographically ordered set S = 
{1,2,...,n} x N is well-ordered. 


Proof: 
Let T be a non-empty subset of S. Let u be the least element of the set {r € {1,2,...,n}: 
(r,t) € S}. Since every natural number is well-ordered (14.4) such a number u exists. 
Let v be the least number in {t € N : (u,t) € S}. Since N is well-ordered (14.3) such a 


number v exists. Then (u,v) < (i, j) for all (i, j) € A. Hence, every non-empty subset A 
of S has a least element. So S' is <-well-ordered. 


Concepts review: 


264 Section 26: Well-ordered sets. 
1. What is a well-ordered set? 
2. What is an initial segment of a well-ordered set? 
3. Is a well-ordered set an initial segment of itself? 
4. Give three examples of well-ordered sets. 
5. Is the lexicographic ordering of N x N a well-ordering? Why or why not. 
6. List all initial segments of the natural number 7. 
7. Give an infinite initial segment of Nt912}. 
8. What is an order isomorphism between two well-ordered sets? 
9. Can a well-ordered set be order isomorphic to one of its initial segments? 
10. If f : S — T where (S, <s) and (T, <r) are linearly ordered sets, what does it mean 
to say that f is strictly increasing? 
11. If a well-ordered set S is order isomorphic to an initial segment of a well-ordered set 
T, can S and T be order isomorphic? 
12. What can we say about two well-ordered sets S and T in reference to order isomor- 
phism? 
13. How many order isomorphisms are there between an initial segment and itself? 
14. If S and T are order isomorphic sets and f and g are two order isomorphisms mapping 
S onto T, what can we say about f and g? 
EXERCISES 


Let S = {0}U { 


1. Is the set of all prime numbers ordered in the usual way a well-ordered set? 
. List the first three initial segments of the set of all prime numbers ordered in the 


usual way. Are initial segments of prime numbers initial segments of N? 


-H : n € N} be ordered by < in the usual way. Is the set S a 
well-ordered set? Justify. 


. Let (S, <) be a well-ordered set. We say that an element b is an immediate successor 


of a if there does not exist an element c such that a < c < b. Show that if S isa 
well-ordered set, then every element of S that is not the maximal element of the set 
must have an immediate successor. 
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5. Suppose (S, <) is a well-ordered set. Is there an order relation we can define on the 
set {T : T is an initial segment of S} which will make it a well-ordered set? 
6. Let No denote the set of all odd natural numbers ordered in the usual way. Are No 
and N order isomorphic? If so say why. If not explain why. 
7. Are the sets of all prime numbers and N both ordered in the usual way order iso- 
morphic? If so say why. If not explain why. 
8. Consider the set {1,2} x N when ordered lexicographically. 
a) List the first few elements of {1,2} x N. 
b. Show that {1,2} x N ordered lexicographically is a well-ordered set. 
b) List three finite initial segments of {1,2} x N. List three infinite initial segments 
of {1,2} x N. 
c) In how many ways (if any) can we map N order isomorphically onto an initial 
segment of {1,2} x N? 
d) In how many ways (if any) can we map N order isomorphically onto {1,2} x N? 


9. Suppose S is a countably infinite set. Show that there exists a well-ordering <s such 
that (S,<,) and ({0,1} x N, <iex) are order isomorphic. 
10. Let S = {1 — “FT : n E€ N} U {1} be ordered by < in the usual way. 
a) Is the set S a well-ordered set? 


b) Are the sets S and N order isomorphic? If so, show why. If not explain why not. 
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27 / Ordinal numbers: Definition and properties. 


27.1 


Summary. In this section we provide some motivation for constructing what 
will be called the “ordinal numbers”. We then formally define these. We see that 
N and all of its elements are ordinal numbers. When N is viewed as an ordinal 
number it is represented by w. We review the notions of “transitive sets” and 
“E-well-ordering”. We then define “the immediate successor” of an element of 
a linearly ordered set, and show that the immediate successor of an ordinal num- 
ber is an ordinal number. We also exhibit an ordinal number successor formula, 
at = aU {a}, and show how it is used to recursively construct sets of ordi- 
nals. We then prove a few basic properties of ordinal numbers from which we 
deduce that all pairs of distinct ordinals are €-comparable. We define “limit or- 
dinals”, show how these are constructed and provide methods to recognize them. 


Introduction. 


Our study of infinite sets began with a declaration of what it means for a set to be 
infinite. We stated that only a set S “which can be mapped one-to-one onto a proper 
subset of itself” is referred to as being “infinite”. All other sets are “finite” sets. So 
initially, sets were either finite or infinite. Then we discovered that infinite sets could 
be subdivided into two categories: Those which are one-to-one images of N — referred 
to as “countably infinite” sets — and those that are not — referred to as being “un- 
countably infinite”. Then, we discovered that the class of uncountably infinite sets 
actually has a more complicated structure. We found that not all uncountable sets 
were pairwise equipotent. We were led to this conclusion when we proved that no 
infinite set S could be mapped one-to-one onto its power set A(S). This implied 
that we could partition the class of all sets into infinitely many subclasses of sets each 
containing sets which were pairwise equipotent sets. Up to now, our attention has 
mainly been centered on investigating the properties of those sets which belong to the 
class of all countably infinite sets and the class of all sets which are equipotent to R 
(since the sets N and R are the two sets we are the most familiar with). 


Within the class of all sets, we investigated the subclass of all well-ordered sets. 
We saw that order isomorphisms allow us to refine even further our classification of 
infinite sets. For example, a class of all well-ordered sets can be further partitioned 
into subclasses of pairwise order isomorphic well-ordered sets. Recall that an “order 
isomorphism” between two well-ordered sets S and T is a one-to-one function which 
respects the order of the elements in the domain S and the image T of S. That is, 
the order of the elements of the domain and the image is preserved by the one-to-one 
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function. For convenience, we introduced the following notation: 


S~woT & “S and T are order isomorphic” 
S <w T & S~wo W = an initial segment of T 
S <wo T Ss “S$ ~wo Tors <wo T 


We were able to show that all pairs of well-ordered set are <ywo-comparable. This is 
in striking contrast with our first attempts at grasping the structure of the class of all 
sets. The reader will recall that we were not clear on how to prove that “~~,” linearly 
orders the class of all sets, even though we strongly suspect this to be the case. Our 
ultimate objective in this section and the one that follows will be to construct a “well 
ordered class of well-ordered sets” which contains an order isomorphic copy of every 
well-ordered set. We will see that ZFC provides us with the tools to construct a class of 
sets which serves this purpose. The elements of this class of sets will be called ordinals. 


Definition of “ordinal number”. 


The reader will recall that every natural number is a “transitive set”. Transitive sets 
are those sets S that satisfy the rule: 


(ye S)=>(yCcS) 
This property was shown (in 13.7) to be equivalent to the property 
zxeyandyeSSuaes 


which is more suggestive of the notion of “transitivity” with respect to the membership 
order relation €. A proper class which satisfies this transitive property will be referred 
to as a transitive class. We showed (in 13.8) that not only is N a transitive set, but 
each natural number is also transitive (13.9). This is easy to see if we re-examine how 
the natural numbers are constructed: 





ð = 0 
øt = 0 u {0}={0}=1 

Py = 1U {1}={0,1}=2 

Fc DUF =y 

3t = 3 U {3}= {0,1,2,3}=4 

n™ = nU{n}= {0,1,2,...,n}=n+1 


So n+ 1 = {0,1,2,3,...,n} C N is both a subset of N and an element of N contained 
in the subset n + 2 = {0, 1, 2,3,... n +1} CN. 
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The “transitive set” property of a set S does not depend on any particular order 
relation on S. But if every element of a set S is transitive, it makes it possible for 
€ to take on the role of an order relation on S. It is shown in theorems 14.4 and 
14.3 that all natural numbers n, as well as the set N, are strictly €-well-ordered. To 
say that “N is strictly €-well-ordered” means that N is €-irreflexive, €-asymmetric, 
€-transitive, any two distinct elements are €-comparable, and every non-empty subset 
S of N contains a least element x with respect to €. This means that any non-empty 
set, S, of natural numbers contains an element x such that x € y for all y € S. We 
will now discuss €-well-ordered sets which are not necessarily natural numbers. 


Definition 27.1 Let S be a set. If S satisfies the two properties, 


1) S is a transitive set, 
2) S is strictly €-well-ordered 


then S is called an ordinal number. 


The set N and all its elements are ordinals. Since the set N of all natural numbers, 
as well as each natural number, have been shown (in 13.8, 13.9 14.4 and 14.3) to 
be strictly €-well-ordered transitive sets, the class of all ordinals contains infinitely 
many finite ordinals and at least one infinite ordinal, namely N. We will continue to 
represent finite ordinal numbers by the usual lower case letters such as m or n, but 
infinite ordinal numbers will be represented by lower-case Greek letters, such as w, @ 


and £. 


Notation 27.2 When viewed as an ordinal number, N will be represented by the lower-case 
Greek letter w.! We then write, 


27.3 


w = AO 1,2,3, ..., } 


The reader should note that, by definition, only “sets” can be ordinals. That is, a 
strictly €-well-ordered proper class is not an ordinal. 


Constructing new ordinals from known ordinals. 


Remember that each natural number is constructed using what we referred to as a 
“successor constructing algorithm” n +1 = nt = nU {n}. We will use the same 
mechanism to construct numbers beyond w. 


1The letter w is read “omega”. So N has three representations: When simply viewed as a set, we use 
N, when viewed as a cardinal number we use No, when viewed as an ordinal number we use w. Later, the 
symbol, wo, will be used in instead of w. 
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We first show that w Z w: If N € N, then, by definition of N, N is a natural number 
n. But N cannot be a natural number n for if it was, then n € n, contradicting n ¢ n 
proven in 13.9. Sow ¢w. 
Since w is a set, the expression 

wt =wU {w} 


is the union of two sets and so is itself a set. Note that wt Æ w for if it was, then 
w € w, a contradiction. So, from w, we have generated a new set wt. This set is 
represented as, wt = w + 1. If we repeat the procedure again starting with w + 1 we 
obtain w + 2 = (w + 1) =w+1U{w+1}. We confirm immediately that if œ is an 
ordinal number, then a + 1 is necessarily an ordinal number. 


Theorem 27.3 If a is an ordinal number, then so is its successor, at = a U {a}. 
Proof: 


What we are given: That a is an ordinal number (i.e., œ is a transitive set and strictly 
€-well-ordered). 
What we are required to prove: That at is an ordinal number. 


The class at is a set: Note that since a is an ordinal a must be a set. Hence, by Axiom 
3 (Axiom of pair), {a} is a set. By Axiom 6 (Axiom of union), at = aU {a} is a set, as 
claimed. 

The set at is transitive: Suppose x € at = aU{a}. By definition of “transitive” it suffices 
to show that x C at. If x = a, then x C aU {a} = at and we are done. Suppose x # a; 
then x € a. Since a is transitive x C a and so x C at. So at is transitive. It follows that 
when viewed as a relation on a™, € is a transitive relation. 


The elements of the set at are €-comparable: Let x and y be distinct elements in 
at =aU {a}. 

Case 1: If x =a, then y € x (since x £ y). Then x and y are €-comparable. 

Case 2: If both «,y € a, then, since a is known to be €-linearly ordered, either x € y or 
y € x. So all pairs of elements in a* are €-comparable. 

It follows that the relation “€” linearly orders aT. 


The relation € is a strict linear ordering of at: Since € strictly orders a, x ¢ x for all 
x E€ œ. Also a ¢ a, for if a = x € q, then x € x contradicting the fact that € strictly 
orders a. 


The set at is €-well-ordered: Let S be a non-empty subset of at. Let T= S Na. 

Case 1: If T = Ø, then S = {a}. Since a ¢ a, a must be the least (actually the only) 
element of S. 

Case 2: Suppose T # Ø. Since a@ is €-well-ordered, there exists an m € T which is the 
€-least element of T. If S = T, then m is the €-least element of S, as required. If, on the 
other hand, S = T U {a}, since a is the maximal element in at, m < a. Again m is the 
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€-least element of S. 


We conclude that a* is strictly €-well-ordered. So a* is an ordinal number. 


Definition 27.4 Suppose the set, S, is <-ordered. We say that an element y in S is an 
immediate successor of the element x if x < y and there does not exist any element z in S 
such that x < z < y. We say that x is an immediate predecessor of y if y is an immediate 
successor of x. 


If a is an ordinal number, then we naturally expect at to be an immediate successor 
of a. We verify that this is indeed the case. Suppose there exists an element, 8, which 
is “strictly in between a and a*” with respect to the €-ordering. That is, suppose 
ae Beauv {a}. Since 8 4a and 8 Za, “B € aU {a}” is impossible. So at is an 
immediate successor with respect to the €-well-ordering. 


The ordinal constructing mechanism, at = aU {a}, can now be used to construct 
infinitely many ordinals beyond w. 


oe = ARES att 
w+1 = wT = 101, 2.85, ..., w} 
w+2 — (w +1)" = {0,1,2,3, ...,w, w+ 1} 


w+3 = (w+2)t ={0,1,2,3, ...,w, w+1,w +2} 
(w+n)+1 = (wtn)t=nv {n} = {0,1,2,3, ..., w+ n} 


We see that this method for constructing ordinals has a limited range. Are there any 
other transitive “€-well-ordered sets” beyond the set {w,w+1,w+2,w+3,...} of 
ordinals? Our experience with ordinals tells us that there can be. Recall that having 
defined all finite ordinals (natural numbers) 0,1,2,3,..., we gathered together all 
natural numbers to form a new set, N = w = {0, 1, 2,3,...}. We then explicitly proved 
that this new infinite set, w, is itself an ordinal. This illustrates that the “immediate 
successor constructing algorithm” is not the only way to construct ordinals. Consider, 
for example, the set 


w+w = {0,1,2,3,...,w,w+1lw+2,w+3,...} 


obtained by gathering together the ordinals 0,1, 2,3,...,w,w+1,w+2,w+3,...,. Is 
w +w an ordinal? We will soon show that it is. First, we must show how the notions 
of “ordinal” and “initial segment of ordinals” are different ways of describing the same 
object. 
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27.4 


Ordinals viewed as initial segments. 


If the ordinal numbers look familiar to us it is because their properties are generaliza- 
tions of those possessed by the natural numbers. Parts of the proofs of the statements 
that follow mimic the proofs of various properties of the elements of N. Before we 
begin, we remind ourselves of what those subsets called “initial segments” are: A set, 
U, is an initial segment of an ordered set, (S, <), if and only if U is a proper subset 
of S, and 

Yueu, [w< u]= [veU] 


Theorem 27.5 Let a be an ordinal number greater than zero. 


a) 
b) 


c) 
d) 


Every element of œ is an initial segment of a. 


The ordinal a is an initial segment of an ordinal which contains a, namely, at = 
aU {a}. 


Every initial segment of an ordinal a is an ordinal number. 


Every element of the ordinal a is an ordinal number. 


Proof: 


a) 


What we are given: That x is an element of the ordinal a. 
What we are required to show: That x is an initial segment of a. 


Since x € a, and a is strictly €-well-ordered, then x 4 a. We confirm that x is a 
proper subset of a : Since x € a and a is a transitive set, then x C a. 

Let u € x and suppose v € u. We are required to show that v € x. Given that € 
linearly orders a, then € is transitive, and so v Eu E£ Sven. 


We have shown that x is a proper subset of œ which satisfies the “initial segment” 
property with respect to €, as required. 


We have shown that if a is an ordinal, then so is at = aU {a}. Since a € aT, then 
by part a) a is an initial segment of at. 


What we are given: x is an initial segment in a where a is an ordinal number. 
What we are required to show: x is an ordinal number. 


We claim that x is a transitive set: Let z E€ y E€ x. It suffices to show that z € x. 
Now y € x C a implies y € a. Also z € y € a implies z € a (since a is transi- 
tive). So z is €-less than y, with respect to a’s order relation €. Since x is an initial 
segment, z is €-less than y and y € x implies z € x. So x is a transitive set as claimed. 
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The relation, €, is a strict linear ordering of x : All elements of x are elements of a 
(since  C a) so x inherits from a all €-ordering properties, including €-transitivity 
and €-linearity. So x is €-linearly ordered. Since u ¢ u for all u in a, then this must 
be the case for all elements in x. So € is a strict linear ordering of x. 


The set x is €-well-ordered : Let T be a non-empty subset of x. We are required to 
show that T contains an €-least element. 

Case 1: If T = x, then, since x is an initial segment, T contains the €-least ordinal, 
0, and so we are done. 

Case 2: Suppose T C x. Since a is €-well-ordered T = T N x contains its €-least 
element, say y. Since y E TAN x, y € x. So y is an element of x which is the €-least 
element of T. So x is €-well-ordered. 

So x is a transitive strictly €-well-ordered set. We conclude that x is an ordinal num- 
ber. 


d) If y is any element of the ordinal a, then by part a) y is an initial segment of a. 
Having shown in part b) that initial segments of ordinals are ordinals, then y is itself 
an ordinal. 


We now show that any infinite ordinal (other than w itself) contains w. 


Proposition 27.6 Any infinite ordinal not equal to w contains w. 
Proof: 


What we are given: q is an infinite ordinal number. 

What we are required to show: w € a. 

We claim that {n:n Ew} Ca: 

We prove the claim by induction. Let P(n) be the statement “The natural number n 
belongs to a”. 

Base case: We are required to show that P(0) holds true. Since a is infinite it is non- 
empty. Let y be the €-least ordinal in a. If y = 0, then we are done. Suppose y Æ 0. 
That is, suppose y is non-empty. Then, when viewed as a subset of a, it contains a least 
element x. We then see that x € y € a contradicting the fact that y is the €-least ele- 
ment of a. The source of the contradiction is our supposition that y is not zero. Hence, 
y=0€a. So P(0) holds true. 

Inductive hypothesis: Suppose P(n) holds true for some natural number n. That is sup- 
pose that n = {0,1,2,3,...,n—1} € a. Since a is transitive, n = {0,1,2,3,...,n—1} Ca. 
Then n+1 = {0,1,2,3,...,n—1,n} =nU{n} Ca. Since a is infinite, we actually have 
n+1ca. Sincen+1= {0,1,2,3,...,n—1,n} is an initial segment of a it is an ordinal 
in a. Then P(n*) holds true. 
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By the principle of mathematical induction, a contains every natural number, as claimed. 


Since w Æ a, then w is an initial segment of a. So w € a as required. 


Proposition 27.7 Let a and 8 be distinct ordinal numbers. If a C 8, then a € 2. 
Proof: 


What we are given: That a and p are distinct ordinal numbers. 
What we are required to show: That (a C 8) > (a € 8). 


Suppose a C 8. The set 8 — a is non-empty, and so contains its least element, say y. We 
will show that y = a. If so, then a € 8 and we are done. Since elements of ordinals are 
ordinals, y is an ordinal number. 


We claim that a C g: 


— Let x € a C p. Then z is also an ordinal number. It suffices to show that x € y. 
Suppose x ¢ y. Since ( is €-linearly ordered, x ¢ y implies either y E€ x or y = x 
holds true. But y E€ x C a or y = x C a implies y € a (since a is transitive). This 
contradicts y E€ G—a. So x € y. It follows that œ C y as claimed. 


We claim that a = y: 


— Suppose a C y. Then there exists x € y—a C 8 — œ. This means x is an element in 
B — a which is strictly €-less than its least element y. This contradiction is caused by 
our supposition x € y — a. We conclude that a = y as claimed. 


We have shown that if œ C p, then a is the the least element of 8 — a and so a € 8, as 
required. 


The above results have an implication which is worth pointing out immediately. We 
have shown in theorem 27.5 that for every ordinal a, œa is an initial segment of 8 = 
aU {a} with respect to the € order relation. Then we can write 


y={acp:ac7} 


where the ordinal y is the leader of its initial segment. This is the case, for any ordinal 
y. This is consistent with what we have observed up to now. Witness the ordinal, 
3 = {0,1,2} = {n : n € 3}, where 3 is the leader of the ordinal 3, and the infinite 
ordinal, w = {0,1,2,3,...,} = {n : n € w}, where w is the leader of the ordinal w.! 


‘Unfortunately we cannot deduce from this that the set {a : a is an ordinal, a € w + w}” is an ordinal 
since we have not yet shown that w +w is an ordinal. 
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27.5 The membership relation, €, linearly orders the class of all ordinals. 


Similarities between the methods of construction of the ordinals and the natural num- 
bers strongly suggests that the relation “€” linearly orders the class of ordinal num- 
bers. This remains to be proved. Before we do this, we must prove the following 
lemma. 


Lemma 27.8 If the ordinals a and £ are order isomorphic, then a = £8. 


Proof: 
What we are given: The sets a and @ are ordinals for which there exists an onto order 
isomorphic map f : a —> £. 
What we are required to show: That a = 8. 


Let S = {x E€ a : f(x) # x}. Recall that order isomorphisms are strictly increasing. Since 
0 is the least ordinal of both a and 8, f(0) = 0 (if, for example, f(0) = 1, then f must 
map some element a > 0 to 0 < 1, a contradiction). Hence, S is not all of a. 


If S = Ø, then f(x) = zx for all x in a; then a = 8 and we are done. 
Suppose S # @. We claim that this will lead to a contradiction: 


— Since a is €-well-ordered, S contains a smallest element, say d. Since d € S, then 


f(d) Z d. 
We claim: f(d) C d. 


If x € f(d) in 2, then there exists z € a such that f(z) = x. 


Since f respects €-ordering 


cefd) => f(z) e f(A) 
=> zed 


Since d is the smallest element such that f(z) Ax, z € d > f(z) =z. But f(z) =a. 
Soze€d>azed. 


This shows x € f(d) > x € d. So f(d) C d, as claimed. 


— But we also see that d C f(d), since 


u E€ d= f(u) =ue fld) 
So d C f(d) and f(d) C d implies d = f(d) which contradicts the fact that d is least 
ordinal such that f(d) # d. 


So S must be empty. This means that f is the identity map. We can only conclude that 
a=. 
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Theorem 27.9 The relation “€” linearly orders the class of all ordinals. 


Proof: 


We have shown in the theorem 26.7 that any two well-ordered sets S and T are either 
order isomorphic or one is order isomorphic to an initial segment of the other. If the 
ordinals a and ( are not order isomorphic, then one must contain an order isomorphic 
copy of the other. Suppose, without loss of generality, that œ is order isomorphic to 
an initial segment y of @. By part b) of 27.5, y must be an ordinal number. Since 
qa and y are order isomorphic ordinals, then, by lemma 27.8, they must be the same 
ordinal number. Hence, a € B. We can conclude that any two ordinal numbers are 
€-comparable; so “€” linearly orders the class of all ordinals. 


27.6 


The immediate successor of an ordinal is unique. Since “€” linearly orders the class 
of ordinals, then this restricts the number of immediate successors an ordinal can 
have. For suppose 3; and (2 are immediate successors of the ordinal a. If 6, 4 b2, 
then either 3, € G2 or B2 € Bı. Suppose without loss of generality that 6, € 62. 
Then a € (3; € B2. But this implies that G2 is not an immediate successor of a, a 
contradiction. We must conclude that 81 = B2. So the immediate successor of an 
ordinal is unique. 


Limit ordinals. 


We have seen that all ordinals are initial segments of ordinals. Now an initial segment 
of a linearly ordered set may or may not contain a mazimal element. For example, 
the ordinal number 


w +2 = {0,1,2,...,w,w +1} = {0,1,2,....whU{w+l}=w+1U {w+} 


contains the maximal element, w + 1, since every element of w + 2 is either w + 1 
or is contained in w + 1. On the other hand, the ordinal w = {0,1,2,3,...,} is an 
initial segment {y : y E€ w} of the ordinal w + 1 which has no maximal element. If 
an ordinal 8 has a maximal element, say a, then 8 is the immediate successor of a. 
(Equivalently, the maximal ordinal a is an immediate predecessor of 3.) We can then 
divide the class of all ordinals into two subclasses: 


1) Ordinals 6 that contain a maximal element with respect to €. These are precisely 
the ordinals that have an immediate predecessor. That is, 


B% = {0,1,2,..., 8} 
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2) Ordinals 8 that do not contain a maximal element with respect to €. These are 
the ordinals that do not have an immediate predecessor. They can be represented 
as, 

B={a:ae p} 
For example, w = {0,1,2,...}. 


Those ordinals which do not have a maximal element (equivalently, do not have an 
immediate predecessor) are called “limit ordinals”. We define this formally. 


Definition 27.10 An ordinal a which does not contain a maximal element is called a limit 
ordinal. 


27.7 Constructing limit ordinals. 


Suppose U is a non-empty set whose elements are ordinals. What can we say about 
the union, U{a : a € U}, of all ordinals in the set U? In particular, we ask the 
question: Is the union of all ordinals in U necessarily an ordinal? We will show that it 
must be so. Whether this union of ordinals is a limit ordinal, or a non-limit ordinal, 
will depend on whether the set U contains, or does not contain, a maximal ordinal 
with respect to €. 


Proposition 27.11 If U is a non-empty set of ordinals which contains a maximal element 
B with respect to €, then the union, U{a@: a € U}, is equal to the maximal ordinal, 8, of 
U. 


Proof: 


We are given that 8 is an ordinal which is the maximal element of a set of ordinals U. 
Then a € p, for all a in U which are distinct from 8. Since 8 is an ordinal it is a tran- 
sitive set and so, for all a € U such that a 4 B, ac B. Then Ufa: ace U,a 4B} CB. 
Since GE U, 8 =U{a: ac U}. 


We have shown above that if 3 is the maximal element of U, U{a : œ E€ U} = 8. Hence, 
in such a case, U{a : a E€ U} cannot be equal to U. For example, if U = 3 = {0, 1, 2} 


Ufa:a EU} U{0, 1,2} 
OU1LU2 
ØU {Ø} U {, {S}} 


{2,{}} =24U 
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We will now show that if U is a set of ordinals which contains no maximal element, 
then U{a : a € U} is a limit ordinal which is not contained in U. 


Theorem 27.12 If U is a set of ordinals which does not contain a maximal element with 
respect to “€”, then y = U{a: a € U} is a limit ordinal which is not contained in U. 
Furthermore, y is the €-least ordinal which contains all elements of U. 


Proof: 


What we are given: That U is a set of ordinals with no maximal element and y = U{a : 
aéU}. 

What we are required to prove: That y is a limit ordinal which is not an element of U; 
that y is the least ordinal containing all elements of U. 


The class y is a set. Recall that ordinals are sets (by definition) and, since U is declared 
to be a set, y is the union of a set of sets. Axiom 6 guarantees that y is a set. 


Every pair of elements in y are €-comparable. Any two elements of y are both elements 
of some ordinal and so are themselves ordinals. By theorem 27.8, they are €-comparable. 


The set y is a transitive set. Let 8 be an element of y. Then 3 € a for some a € U. Since 
a is transitive, 8 C a. Since a C y, B C y. Then y is a transitive set, as claimed. 


The set y is a well-ordered set. If A is a non-empty subset of y, AN a Æ Ø, for some 
a € U. Since a is €-well-ordered, A N a contains a least element 8. Let Y be any element 
(equivalently, ordinal) in A not equal to 8. If y € 8, then Yy € a. Then w is an element 
of ANa which is strictly €-less than the least element @ in AUa. Since this cannot be, 
B € p. So B is the least element of A. Then y is €-well-ordered, as required. 

We have thus shown that y = U{a: a € U} is an ordinal. 

The ordinal y = U{a : aœ € U} does not belong to U. Note that a C y for all a € U. Since 
y has been shown to be an ordinal, then for any a € U not equal to y, (a C y) > (a € 7) 
(by Proposition 27.7). Suppose y € U. Then, for all a € U such that a 4 y, a € y. This 
implies that y is a maximal element of U, contradicting our hypothesis stating that U has 
no maximal element. Then y ¢ U, as claimed. 

The ordinal y = Ufa: a E€ U} is a limit ordinal. Suppose not. That is, suppose 
y = BF = BU {8}. Then 8 € y. This means that G € ¢ for some ordinal ¢ € U. 
Equivalently, 6 C ¢ (since ¢ is transitive). Now (¢ € U) = (¢ C y) (we have shown that 
y ¢ U, so we can use strict containment “o C y”). It follows that ¢ C 8. But BC @CB 
implies 8 C 8, a contradiction. The source of this contradiction is our supposition that 
y has an immediate predecessor 3. Then y has no immediate predecessor and so is, by 
definition, a limit ordinal. 

The ordinal y = U{a: a E€ U} is the least ordinal which contains all elements of U. Sup- 
pose a € U. Then a C y. Since y has been shown to be an ordinal, then a € y (by 27.7). 
Then U C y. We claim that y is the least ordinal satisfying this property. Suppose w is 
some ordinal such that Y% € y. Then w € a for some a € U. Then a Z y. Then U Z w. 
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So y is the least ordinal which contains all elements of U, as claimed. 


We summarize. The above theorem now provides us an alternate method for con- 
structing new ordinals from known ordinals. Taking the union of a set U of ordinals 
where U contains no €-maximal element will always produce a limit ordinal which 
does not belong to U. 


Corollary 27.13 Let U be a non-empty set of ordinals which contains no maximal element. 
If U satisfies the “initial segment property”, then U is the limit ordinal U{a : a € U}.! 


Proof: 


What we are given: That U is a set of ordinals which contains no maximal element and 
satisfies the initial segment property. 
What we are required to show: That U = U{a: a € U}. 


We have shown above that the set, U{a : a € U}, is the least ordinal which contains all 
the elements of U. So, certainly, U C U{a : aœ € U}. 

Suppose 3 € U{a:a€ U}. Then 8 € a for some a € U. Since U satisfies the “initial seg- 
ment property”, 8 € U. Then U{a : a € U} C U. We conclude that U = U{a : a € U} 
as required. 


Examples. 


a) We define 
w +w = {0,1,2,3,...,w,w + 1,w +2,w +3,...,} = wU {w +n} o 
We conclude that w + w is a limit ordinal by arguing as follows: 
— Since w +w is the union of two countable sets, it is a set. 
— It is easily seen that w + w contains no maximal element and satisfies the initial 
segment property. 


— By the above corollary, w +w = {a : œ E€ w +w} is the least ordinal which 
contains all finite ordinals and all infinite ordinals of the form w+n where n € N. 
Then w +w is a limit ordinal number. 


b) If we define 
w+w+w=w+wU{w+w, w+w+1, w+w+2, w+w+3,...,} 


we can similarly conclude that w+w +w is the least ordinal which contains all ordinals 
in w +w, and ordinals of the form w +w + n, where n € N. 





‘The set U satisfies the “initial segment property” if Va € U, [y € a)] > [y € U]. 
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27.8 


c) Ordinals such as, w +w, w +w +w and w +w +w +w, are more succinctly written 
as 
w2, w3, w4, w5,... 


and so on. We denote the set of ordinals U{wn : n € N} by 
ww 


We continue constructing, in this way, larger and larger ordinals. Following these 
general principles for representing ordinals we list a few more of these: 


E RO e EE EA Sen eam OE a T a a A A T 
A E E ad Fas 5 to" WS. cate R a he? NT 8s 
2 w 
OTE oc ONE e O ae oO a E ted aa 


Since every one of these is the countable union of countably many ordinals, each is 
countable; so each of these is a set. 


Characterizations of limit ordinals. 


We can also describe limit ordinals in terms of “least upper bounds”. We first remind 
the reader of what is meant by “least upper bound” of an ordered set. 


Suppose T is a non-empty subset of a strictly ordered set (S, <). If u is an element 
of S such that t < u for all t € T, then we say that u is an upper bound of the set T. 


Definition 27.14 Let T be a non-empty subset of an ordered set (S, <). If u is an upper 
bound of the set T and, for any other upper bound v of T, u < v, then we say that u is 
the least upper bound of T. We also abbreviate the expression by writing u = lub T or u = 
lub(T). 1 


Note that if a set y is a non-empty ordinal and a €=- 8 for all a € y, then @ is an 
upper bound of y.? The least upper bound of a “non-limit (non-empty) ordinal” is 
always its maximal element. For example, 


lub(w + 2) = lub{0,1,2,3,...,w,w+1}=w+1 
since every element of this set is “€-less than or equal to” w+ 1. So 


lub(w + 2) Aw+2 


lInstead of “least upper bound of T” the word supremum of T is commonly used. 
2The symbol €= means “equal to or belongs to”. 
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But if 8 is a limit ordinal (such as w = {0,1,2,3...,}, for example), then it has no 
maximal element (immediate predecessor) and so lub(8) ¢ 2. 


The following theorem provides various ways of recognizing limit ordinals. 


Theorem 27.15 Let y be a non-zero ordinal number. The following are equivalent: 
1) The ordinal y is a limit ordinal. 
2) The ordinal y is such that y = U{a: a € y}. 


3) The ordinal y is such that lub(y) = y. 


Proof: 


(1 = 2) Suppose y is a limit ordinal. Then y is an initial segment (27.5) which has no 
maximal element. By corollary 27.13, y = U{a:ae 7}. 


(2 = 1) Suppose the ordinal y is such that y = U{a:a € y}. If y contains a maximal 
ordinal 3, then a €= for alla € y. Then U{a:a€ y} = 8 #7, a contradiction. Then 
y contains no maximal element. By definition, y is a limit ordinal. 


(1 = 3) If y is a limit ordinal, then y does not contain a maximal element. By theorem 
27.12, y = U{a:a€ y} is the €-least ordinal containing all elements of y; hence, y is the 
€-least upper bound of y. 


(3 = 1) Suppose the ordinal y is such that lub(y) = y. If y has a maximal element £, 
then a E€- 8 for all a € y. Then 8 = lub(y) € y, a contradiction. Then y cannot have a 
maximal element and so is a limit ordinal. 


Here are a few examples of least upper bounds of sets of ordinals. 


w = lub{0,1,2,3,...} = {0,1,2,3,..} =U{n:n E€ wy 
w+w = lub{0,1,2,3,...,w,w+1,w+2,...,} 
w3 = lub{0,1,2,3,...,w,w+1,w+2...,w2,w2+1,...,} 
w4 = lub{a:a€ w4} 
4 = lub{0,1,2,3,4}= lub(5) 


Concepts review: 


1. What is an “ordinal number”? 
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10. 
11. 
12. 
13. 
14. 
15. 
16. 
17. 
18. 


19. 


. What is a transitive set? 
. What does it mean to say that a transitive set is strictly €-well-ordered? 


. Given two elements x and y of a well-ordered set what does it mean to say that y is 
an immediate successor of x? 


. Give an example of an infinite linearly ordered set which contains elements with no 
immediate successor. 


. Describe a method for constructing immediate successors of ordinals. 

. When viewed as an ordinal number, how do we represent N? 

. Given an ordinal number a, which one of its elements are initial segments of a? 
. Can an ordinal number be an initial segment of itself? 

Which elements of an ordinal number a are themselves ordinal numbers? 
Which elements of an ordinal number a are proper subsets of a? 

Which subsets of an ordinal number a are elements of a? 

What can be said about two ordinals which are order isomorphic? 

How are limit ordinals different from non-limit ordinals? 

How are the ordinals w2, w3 and w4 described? 

What kind of ordinals can be represented as y = U{a: a € y}? 

What is the least upper bound (supremum) of a limit ordinal a? 

What is the least upper bound (supremum) of a non-limit ordinal a? 


What does the expression lub(a) = a say about the ordinal a? 


EXERCISES 


1. Find a well-ordered set which is order isomorphic to the ordinal number w + 3 = 
{0,1,2, ..., w,w + 1,w +2} (other than w + 3 itself). 

2. Let S be a €-well-ordered transitive set. Construct another €-well-ordered transitive 
set which contains S. 

3. What is the smallest ordinal that properly contains the ordinal number w? Find the 
smallest ordinal that properly contains all elements of the ordinal number w. 


282 


Section 27: Ordinal numbers: Definition and properties. 


4. What is the intersection of all non-zero ordinals? 


11. 
12. 
13. 


14. 
15. 
16. 
17. 
18. 
19. 


. Does the ordinal w + 2 contain an order isomorphic copy of N which is not an initial 


segment of w + 2? 


. True or False: “Every subset of an ordinal is an ordinal”. 


. Find an ordinal number which is order isomorphic to the set {1,3,5, ...,0,2,4} of 


natural numbers ordered in this particular way. 


. Is the union of two distinct ordinal numbers necessarily an ordinal number? Explain. 


. Show that there cannot exist a largest ordinal number. 
10. 


Can there be an ordinal number that is not the immediate successor of another 
ordinal number? Explain. 


Find three ordinal numbers which are equipotent with the ordinal number w + 3? 
Let U be a set of ordinals. Show that N{a@: a € U} is an ordinal. 


Let A be a set of ordinals. Show that the intersection of all ordinals in A is the least 
ordinal of the set A. 


What is the union of all ordinals in w + 1? 

Prove that {wn : n € N} is an ordinal number. 
What is the smallest limit ordinal in w3? 

What is the union of all ordinal numbers in w3 + 1? 
Show that the ordinal ww is countable. 


Construct a well-ordered set which is order isomorphic to the ordinal ww (other than 
ww itself). 
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28 / Properties of the class of ordinal numbers. 


28.1 


Summary. In this section we discuss the class, @, of all ordinal numbers. It 
will be seen to be an €-well-ordered transitive proper class of sets. Initial seg- 
ments of Ê are shown to be ordinals. Once we prove the “Principle of transfinite 
induction” we show that every well-ordered set is order isomorphic to some or- 
dinal. We then show that for any set, S, there exists an ordinal which cannot be 
embedded in S. We introduce what is known as the “Hartogs number of a set” 
and use this concept to construct a strictly increasing sequence of uncountable 
ordinals indexed with the ordinals. 


A well-ordering of the class of ordinal numbers. 
By definition, each ordinal is an €-well-ordered transitive set. Our investigation of 
ordinal numbers has revealed that: 
1) Each ordinal has an immediate successor ordinal with respect to € (27.3). 
2) 
3) Given any two distinct ordinals, one is an element of the other (27.9). 
4) 


Each ordinal is both an element and subset of any ordinal that contains it (27.7). 


Some ordinals have no immediate predecessor. Examples are 


ii ADO ot 
POS HOA, To nto 25 <5 
w3 = {0,1,2,....wt+tl,w+2,...,w2,w2+1,w2+2,...,} 


These are called limit ordinals. Limit ordinals are seen to be those ordinals which 
do not contain a maximal ordinal. Equivalently, they are those ordinals a, such 
that a = lub(a) (27.15). 


Such properties are not entirely new to us since the set, N, and its elements are known 
to satisfy these very same properties. The class of ordinals appears to be a generaliza- 
tion of the natural numbers reaching well-ordered sets which are not order isomorphic 
to N or any of its elements. We will now investigate the class which contains all ordi- 
nals. 


Notation. The class of all ordinal numbers will be denoted by, @. 
In the next few pages, we will show that the class, @, itself satisfies most of the prop- 


erties possessed by its elements. We will begin by showing that @ is an €-well-ordered 
class. 


Theorem 28.1 The class, @, of ordinal numbers is a strict €-linearly ordered class. 
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Proof: 


What we are given: @ is the class of all ordinals. 
What we are required to show: That “e€” is a strict linear order relation on @. 


— Since a ¢ a for all a € Ê, “€” is irreflexive and asymmetric. We verify transitivity of 
the order relation E: Ifa € Gand GE y, then a C 8 and B C y; so a C 4; this implies 
a E€ y. So € isa strict order relation on @. 


— That every pair of ordinals are €-comparable has been shown in theorem 27.8. 


We conclude that the class, @, of all ordinal numbers is €-linearly ordered. 


Theorem 28.2 The class, @, of all ordinal numbers is €-well-ordered. 


Proof: 


What we are given: That S is a non-empty subset of ordinal numbers in @. 
What we are required to show: That S contains an ordinal 8 such that 8 €- a for all 
ae S. 


Suppose y € S. If y € a for all a € S, then y is the least ordinal of S and we are done. 
Suppose there is some ordinal a € S such that a € y. Then yMS is a non-empty subset 
of the well-ordered set y. This means y N S' contains a least element 8. We claim that 
b is the €-least element of S. Suppose ¢ is an ordinal in S such that @ € 8. Since 8 is 
an element of y and y is transitive, o € y. Then ¢ is an element of S Ny which is €-less 
than the least element, 8, of yN S. This is a contradiction. So @ is the least element of 
S. This shows that @ is €-well-ordered. 


We know that each ordinal, a, is an initial segment of any ordinal that contains it and 
that the initial segment of any ordinal is an ordinal. The following theorem confirms 
that initial segments of @ (with respect to €) are precisely the ordinal numbers. 


Theorem 28.3 A set S is an initial segment of @ if and only if S is an ordinal number. 
Proof: 


If S is an €-initial segment of @, by theorem 26.3, there exists an ordinal y such that 
S={aEe@:acy}. But {a:a € y} isan initial segment of yU{y} and so, by theorem 
27.5, is an ordinal. 

Conversely, if S = y for some y € @, then y = {a : a € y}, an initial segment of ©. 
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28.2 The €-well-ordered class, @, is not an ordinal number. 


Recall that a set or class © satisfies the transitive property if “x € S >x CS”. We 
easily verify that @ is a transitive class: 


b E€ C and ae b > QE O (Since elements of ordinals are ordinals.) 
> BPCO 


=> @ isa transitive class. 


Since @ is a transitive €-well-ordered class of ordinals, it possesses all the essential 
properties of ordinals. But if @ is to be an ordinal, it must also be a set. The following 
theorem confirms that Ê cannot be a set. 


Theorem 28.4 The class, @, of all ordinal numbers is not a set.! 


Proof: 


Suppose the class @ of all ordinal numbers is a set. Then, since @ is transitive and 
€-well-ordered, it is an ordinal number. Then @ € @. Since @ is a transitive set, Ô C Ê. 
Since no ordinal number can be order isomorphic to a proper subset of itself (theorem 
26.5), this is a contradiction. So @ cannot be a set. 


28.3 The Principle of transfinite induction over the ordinals. 


The principle of mathematical induction over the natural numbers was seen to be an 
extremely useful tool to prove that certain properties hold true for sets whose elements 
can be indexed by the natural numbers. We remind ourselves of what it means to 
prove a statement by mathematical induction on N (or on the ordinal w as we can 
now call it). Suppose P(n) is a property which holds true depending on the value of 
the natural number (the finite ordinal) n. The principle of induction on w states that 
if P(0) holds true, and “P(n) holds true” = “P(n + 1) holds true”, then P(n) holds 
true for all values of n. We will show that this principle generalizes to mathematical 
induction over the ordinal numbers. We must, however, keep in mind that there is 
an important difference between @ and w. Some of the elements of an ordinal a may 
be limit ordinals. A limit ordinal G has no immediate predecessor and so applying 
the induction algorithm “P(a) = P(at)” cannot be used to prove that P() holds 
true. Mathematical induction generalizes to ordinals provided we can verify that P(8) 
holds true for limit ordinals, 3, as well as for non-limit ordinals. 


1Stating that the class of all ordinals is a set leads to what is referred to as the Burali-Forti paradox, the 
contradiction illustrated in this proof. It shows again that some classes are too large to be called “sets”. 


286 Section 28: Properties of the class of ordinal numbers. 


The principle of mathematical induction can be used to prove statements about 
classes or sets whose elements can be indexed by ordinals. For example, if y € @ 
and S = {£a : a E€ y}, then, since y is a set and S is a one-to-one image of y, by the 
Axiom of replacement, S is a also a set. Furthermore, the ordinal y induces a well- 
ordering on the set S. The following theorem shows how this is done. It is followed 
by a second version of mathematical induction. 


Theorem 28.5 Principle of transfinite induction. Let {£a : a E€ @} be a class whose 
elements are indexed by the ordinals. Let P denote a particular element property. Suppose 
P(a) means “the element xq satisfies the property P”. Suppose that for any 8 € @, 


“P(qa) is true V a € 8” implies “P() is true” 


Then P(qa) holds true for all ordinals a € @. 
Proof: 


We are given that for every ordinal 3, “P(q) is true for all a € 8 implies P(() is true”. 
Suppose there exist some ordinal y such that P(y) is false. 

We claim this will lead to a contradiction: By our supposition, the class A = {a : 
P(q) is false} is non-empty. Since @ is €-well-ordered, A must have a least element, 
say A. That is, A is the least ordinal such that P(A) is false. Since this is the least 
element of A, P(a) holds true for all a € A. By hypothesis, P(A) must be true. We 
obtain a contradiction, as claimed. 

Then the set, A, must be empty. So P(q) holds true for all ordinals a. 


Corollary 28.6 Transfinite induction: A second version. Let {£a : a € Ê} be a class 
whose elements are indexed by the ordinals. Let P denote a particular element property. 
Suppose P(a) means “the element £a satisfies the property P”. Suppose that: 


1) P(0) holds true, 
2) P(a) holds true implies P(a + 1) holds true, 
3) If 3 is a limit ordinal, “P(q) is true for all a € 8 implies P(8) is true”. 


Then P(qa) holds true for all ordinals a. 


Proof: 


We are given that P is a property for which conditions one, two and three hold true. 
We are required to show that P(@) holds true for all ordinals £. 
Let 8 be an ordinal. 


— If 8 =0, then by condition 1), P(@) holds true. 
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— Suppose 3 has an immediate predecessor, say At = 3, such that P(A) holds true. 
By condition 2) P(8) = P(A*) holds true. 


— Suppose is a limit ordinal such that “P(qa) is true for all a € 8”. Then by 
condition 3) P(8) holds true. 


So P(Z) holds true for all ordinals 8. 


28.4 Indexing any well-ordered set with an ordinal. 


In theorem 26.7, it was shown that given any two distinct non-order-isomorphic well- 
ordered sets, one is order isomorphic to an initial segment of the other. Since ordinals 
are, by definition, well-ordered, given any well-ordered set S and ordinal number a, 
precisely one of the following two statements must hold true: 


1) The set S is order isomorphic to some ordinal 8 € a. 
2) The ordinal a is order isomorphic to an initial segment of S. 


If the set S is order isomorphic to some ordinal 8, then this is equivalent to saying 
that “the elements of S can be indexed by the elements of the ordinal 8”. On the 
other hand, if the ordinal œ is order isomorphic to an initial segment of S, clearly the 
elements of a cannot be used to index the elements of S since a is not “big enough” 
to be used as an indexing set for S. It is not yet clear whether, given any well-ordered 
set S, there exists some ordinal which can be used to index the elements of S. The 
following important theorem guarantees that there are sufficiently many ordinals in 
Ê so that every single well-ordered set © is order isomorphic to some ordinal. That 
is, any well-ordered set S can be indexed by the elements of some ordinal number a. 


Theorem 28.7 Let S be a <-well-ordered set. Then S is order isomorphic to some ordinal 
number a € Ê. Furthermore, the order isomorphism mapping S onto a is unique. 


Proof: 


What we are given: The set S' is a <-well-ordered set. 
What we are required to show: There exists a unique ordinal œ which is order isomorphic 
to S. The required order isomorphism, f : S — a, is unique. 


For each element k € S, let Sk = {x E€ S : x < k} denote an initial segment of S. Let 


D = {u € S: Su ~wo Qu for some ordinal number a,} 
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Let 
f ={(0g,0)} U {(u, ay): ue D} CDx G! 


Now D is non-empty since 0g € D (where Og is the <-least element of S). 

We claim that f is a function with domain D: Suppose (u, a) and (u, 3) both belong to 
f. Then Su ~wo a and Su ~wo B. So a ~wo b. By lemma 27.8, œ = 8. Since the 
domain of f is a set, there is a set in the class @ which contains the image of the set D 
under f (by the Axiom of replacement). So the relation 


f: D — © defined as f(u) = ay 


is a function with domain D, as claimed. 


The function f is strictly increasing and so is an order isomorphism: If u,v € D such 
that u < v, then Su C Suv. Now, Qu ~wo Su C Sy ~wo Gy. Given that a, and ay 
are ordinals, a, € Qy. Then, u < v implies f(u) = au € ay = f(v). So f is strictly 
increasing. We conclude that f : D — f[D] C @ is an order isomorphism. 


We claim that D is a subset of S which satisfies the initial segment property: If u € D, 
then f(u) = a, for some a, E€ Ê. That is, there exists an order isomorphism g : Su > Qu 
mapping Su onto Qu. If x < u, then Sy C Su. The function g|s, is an order isomor- 
phism mapping S,, onto an initial segment (equivalently an ordinal), say a, in a,. Then 
f(x) = a,. We have shown that Vu € D, [x < u € D] => [x € D]. Hence, D satisfies the 
initial segment property, as claimed. 


We now claim that f|D] is an ordinal number: It suffices to show that f[D] is an initial 
segment of @ and invoke theorem 28.3 (which states that any initial segment of @ is 
an ordinal). Since f|D] has been shown to be a set, then f[|D] cannot be equal to the 
proper class Ê. Let a, € f[D]. Then there exists an element v € D such that S, is order 
isomorphic to ay. Let 8 € ay. Then B C ay. Since S, is order isomorphic to a,, then 
B is order isomorphic to an initial segment Su C Su. Then f(u) = 8. So B € f|D]. We 
have shown that if 6 € a, € f[D], then 8 € f[D], and so, f[D] is an initial segment of 
©. Then, by theorem 28.3, f[D] is an ordinal number, as claimed. 


Finally, we claim that D = S: Suppose the domain D of f is not all of S. We have shown 
that f[D] is an ordinal number. Say f[D] = y € @. Since D # S, there exists some 
q E S such that S, = D (having shown that D satisfies the initial segment property). 
Then f : D— f[D] = f[Sq] is an order isomorphism mapping Sq onto the ordinal y. So, 
by definition of f, (q,y) € f. This means that q € D. But D=S,={x eS: a <q} 


1To better understand the set f we describe the first few elements. Suppose 1s, 2s, 3s represent the first 
few elements of the well-ordered set S. 
Sig ={uEeS:u<l1s}={0s}~wo {f0}=l=aipg => (is, l)ef 
S25 = {u E S: u < 2s} = {0s,ls} ~wo {0,1} =2=a2, => (25,2) Ef 
S35 = {u E S: u < 3s} = {0s,1s,2s} ~wo {0,1,2} =3=a03, => (33,3)E f 
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implies q ¢ D. This is a contradiction. The source of the contradiction is the supposition 
that DAS. Then D = S, as claimed. 

Then f maps S order isomorphically onto the ordinal number y. By theorem 26.5 part 
e), this order isomorphism is unique, as required. 


This means that the elements of any well-ordered set S can be indexed by the elements 
of some ordinal. That is, if (S,<) is a well-ordered set which is order isomorphic to 
some ordinal 3, then S can be expressed as the indexed set S = {sa : a E€ 6}. This 
makes the set S susceptible to proofs by mathematical induction over the ordinal 9, 
an extremely useful tool for proving various mathematical statements. 


At this point, it will be useful to introduce some vocabulary that will allow us to state 
which ordinal is order isomorphic to a given well-ordered set S. 


Definition 28.8 Let S be a <-well-ordered set. If œ is the unique ordinal which is order 
isomorphic to S, then we will say that S is of order type a, or is of ordinality a. If S is of 
ordinality œ, we will write 

ord ¢ =Q 


The set N, ordered in the usual way, can then be said to have ordinality, 
dN — w 
On the other hand the set N* ordered as 
{0,2,4,6,...,1,3,5,...} 


has ordinality 
rIN* = w +w = w2 


The lexicographically ordered countably infinite set S = {1, 2,3} x N has ordinality, 


"IS = w+w +w = w3 


Viewing ordinals as ~wo-equivalence class representatives. Let W = {9 E€ Sf: 
S is well-ordered} denote the class of all well-ordered sets. We easily see that ~wo 
is reflexive, symmetric and transitive on W, and so is an equivalence relation on this 
class of sets. Let W* = {[S]wo : S E€ W} denote the class of all equivalence classes 
induced by ~wo. We have shown that every well-ordered class is order isomorphic to 
some ordinal. Then if [T]wo E€ W*, the equivalence class, [T]wo, contains precisely one 
ordinal which is order isomorphic to every well-ordered set it contains. This means 
that we can adopt the ordinals in Ê as ~wo-equivalence class representatives of the 
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elements in #*. For example, if w3 € [S]wo E W*, it means that [S]wo contains pre- 
cisely all well-ordered sets of ordinality w3. If °¢T’ = wo, then [T]wo contains precisely 
all well-ordered sets which are order isomorphic to N (when N is ordered in the usual 
way). In this case, [T]wo contains Neven = {0,2,4,6,...,} but not the well ordered 
set M = {0,2,4,6,...,1,3,5,7,...} where “M =w+w=w2#w. 


Proving the existence of uncountable ordinals with Hartogs’ lemma. 


Some readers may have noticed that we have not yet exhibited an uncountable ordinal 
(or even a single uncountable well-ordered sett). A potential candidate for the least 
uncountable ordinal may be the set: 


wı = {a € @:a isa countable ordinal } 


This is a well-defined class of ordinals whose elements are precisely all countable or- 
dinals. We show that it satisfies two fundamental characteristics: 


— The class wı satisfies the “initial segment” property : Suppose a € wı and 8 € a. 
Since a is a countable ordinal, then the ordinal 8 is countable and so 8 € w1. Then 
w satisfies the “initial segment” property. 

— The class wı is uncountable: Suppose w1 is countable. Since it is the one-to-one 
image of the set N, w1 is a set. Then w; is an initial segment of ordinals which is 
not equal to @ (since @ is not a set). Then, by theorem 28.3, wı is an ordinal. 
It follows that w; € w1. Since no ordinal can be an element of itself we have a 
contradiction. The class w; cannot be a countable set and so must be uncountable. 


Can we conclude from this that w1 is an ordinal? What if w; = Ê? If so, then w1 isa 
proper class. To show that w, 4 @ it will suffice to show that an uncountable ordinal 
b exists. We must then first show that at least one uncountable ordinal exists. 


The class of all well-ordered subsets of a well-ordered set. Suppose we are given a 
set (S, <s) on which we have defined a well-ordering relation <s. To say that R is a 
well-ordering relation <s on S is to say that for a,b € S, 


a <s b < (a,b) E R 


So we see in fact that R = {(z,y) € Sx S: £ <s y} is a subset of S x S and 
so the relation R is an element of the power set A(S x S). Also note that the 
set S is an element of the power set Z (S). So the well-ordered set (S, <s) can be 
precisely expressed as an ordered pair belonging to A(S) x A(S x S). For example, 
if S = {1, 2,3} and “<” takes on its usual well-ordering “strictly less than”, then 


(S, <) = ({1, 2, 3}, {0 2), (1,3), (2,3)}) 


Tf an uncountable well-ordered set is constructed, then, by theorem 28.7, it must be order isomorphic to 
some (uncountable) ordinal number. 
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represents a particular element of A(S) x P(S x S). Equivalently, it expresses a 
particular well-ordering of the set S. On the other hand, if the ordering “<s” of 
S = {1, 2,3} is such that 3 <s 2 <s 1, then 


(S, <s) = ({1, 2, 3}, {(3, 2), (3, 1), (2, 1)}) 


is a different element of P(S) x P(S x S) testifying to the fact that (S, <s) and 
(S,<) are distinct well-orderings of the set S. 


For any subset, T of S, on which we have defined a well-ordering relation <r, (T, <r) 
is uniquely represented by a specific element of P(S) x P(S x S). For example, 
referring to the same set S = {1, 2,3}, letting T = {1, 2}, 


(Ee) ({1,2},{(1,2)}) E€ P(S) x P(S x S) 


Suppose we let 


As = {(T, <r) E P(S) x P(S x S): <r well-orders T} 


The class s is the class of all well-ordered subsets (T, <r) of S. Since @g is a sub- 
class of the set Z (S) x A(S x S), it is also a sett. 


With these facts in mind, we are now ready to state and prove a very clever result 
known as Hartogs’ lemma.? 


Lemma 28.9 Hartogs’ lemma. Let S be any set. Then there exists an ordinal œ which is 
not equipotent with S or any of its subsets. 
Proof: 


What we are given: That S' is a set. 

What we are required to prove: That there exists an ordinal œ that cannot be mapped 
one-to-one into the set S. 

Let %s = {(T, <r) E P(S) x P(S x S): <r well-orders T} denote the set of all 
well-ordered subsets of S. By theorem 28.7, every well-ordered set has an ordinality 
and so there is a well-defined function f : # — @ defined as f((T,<r)) = (T, <r) 
mapping each well-ordered set in g to some ordinal in @. Then f|.%] is a subclass of 
©. The Axiom of replacement guarantees that f[.%] is a set (not a proper class) and so 
flas] # @. Since f[<Z] is not all of @, there exists an ordinal 8 € C — f[.@%]. 

We claim that the ordinal B cannot be equipotent to any subset of S: For suppose, 
h : B — B is one-to-one and onto a subset B C S. Then, by theorem 26.1, B inherits 
a well-ordering <p from ( (that is, h(u) <s h(v) in B if and only if u € v in 8.) Then 
(B,<z) E€ &. This contradicts the fact that 8 is not in the image s under f. So 8 
cannot be equipotent with any subset of S, as required. 


‘Since S is a set, both P(S) and P(S x S) are sets; hence, P(S) x P(S x S) is a set. 
2Notice how the Axiom of replacement plays a fundamental role in the proof of Hartogs’ lemma, 28.9. 
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Theorem 28.10 There exists an uncountable ordinal. 
Proof: 


Let the set S in the lemma be the set N of all natural numbers. By Hartogs’ lemma, 
there is an infinite ordinal @ which cannot be mapped one-to-one onto any subset of N. 
Since the ordinal ( is not equipotent to any subset of N, then it cannot be countable, So 
b is an uncountable ordinal. An uncountable ordinal exists. 


Corollary 28.11 The class w; = {a € @ : ais a countable ordinal } is the €-least un- 
countable ordinal. 


Proof: 


It was shown that w is a subclass of Ê which satisfies the initial segment property. Har- 
togs’ lemma states that there exists an ordinal number y which is uncountable. Since 
a € y for any countable ordinal a, it follows that w C y. Since y is a set, then wı must 
also be a set; hence, w; cannot be equal to the class @ of all ordinals. Proper subsets of 
CG which satisfy the initial segment property were shown to be ordinals. Hence, w1 is an 
ordinal. Since any ordinal æ € w, is countable, then w; must be the €-least uncountable 
ordinal. 


We can now lay this problem to rest. Uncountable ordinals exist in the ZF'C-universe. 


A few words of caution. One may be tempted to say that since w, is the least un- 
countable ordinal number, then the sets wı and R are equipotent. But nowhere have 
we shown that a one-to-one function between R and w4 exists. If such a one-to-one 
function between R and w was shown to exist, then the ordinal wı would induce a 
well-ordering on R, surely an unexpected result.” 

We pause to review a sampling of (countable and uncountable) ordinal numbers we 
have seen up to now. 


Ww 
O51, 2p WW Do ay WN Go WO pee WI WI BE Lp FW peeing 


The ordinals listed here are strictly €-ordered, in the sense that any ordinal is an 
element of any ordinal which appears on its right. We can also say that the ordinals 
are strictly <wo-ordered, in the sense that any ordinal is order isomorphic to an initial 


?One would never expect that R is well-orderable, but it will turn out to be the case, provided one believes 
in the Axiom of choice. That R is well-orderable does not follow from the Schréder-Bernstein theorem as 
some might suspect. 
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28.6 


segment of any ordinal which appears on its right. But how are the elements of this 
sampling of ordinals ordered with respect to the equipotence relation <,?? In this 
case we obtain 





Oe Loe 2 3B Ce ts Ce NeW 


wW 


WreWtl wett Ne WN Ne't Ne W Neti Ne WY 


w” w” 
W 1e >e Wy Ye W1 tlre we WwW HUW we: Ne WI tw aA 


In the first line, each ordinal is properly embedded into any ordinal which appears on 
its right. In the second line, all ordinals are equipotent to N. In the third line, 
all ordinals to the right of wı are equipotent to wı (since each of these can be 
viewed as the union of wı with a countable set). This sampling of ordinals makes 
us wonder: Are there any uncountable ordinals @ such that w; ~e 8B? Hartogs’ 
lemma guarantees that such ordinals exist. Consider the class H = {a € Ø : 
a is not equipotent to any subset of the ordinal w1}. By Hartogs’ lemma, the set H 
is non-empty. Since @ is a well-ordered class, the non-empty class, H, must have a 
least element, which we denote by wə. Since @ is linearly ordered, wə is the unique 
least ordinal which is not equipotent to wı or any of its subsets. By successively 
invoking Hartogs’ lemma, we can show, in a similar way, that there exists unique 





uncountable ordinals w2, w3,...Wn,..., such that 
0 >e Le 2 >e 3 Sett SeN Se W Se W Se WZ Se W3 Fe 11+ Se Wn: 
The —,-ordered chain of ordinals {0,1,2,3,...,n,...,w, WI, We, .--, Wn, ---,} is 


seen to be a countably infinite set of ordinals. Furthermore, the ordinals listed are 
“c_+.-complete” in the sense that for any pair, a, 3, of successive ordinals in this list, 
there does not exist an ordinal y such that æa —e y ~~ P. But this list of ordinals 
is not “<yo-complete” in the sense that for any n, wn <wo Wn +1 <wo wn + 2 <wo 
+++ <wo Wn+1: That is, if we order the ordinals with “<wo”, then there are plenty of 
ordinals between wn and wn+1- 


This makes us wonder whether there exists an even larger ordinal, 3, such that 
Wn “>e GB, for all n € N. We will prove that there is. 


The Hartogs number of a set. 


To help answer the question above, we introduce the notion of a Hartogs number of a 
set. 


3Recall that A >e B if and only if A is equipotent to a proper subset and A %e B. 
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Definition 28.12 Let S be any set. Let 
Us ={a€@: a not equipotent to any subset of S} 


By Hartogs’ lemma the class, Ug, is non-empty. Since @ is €-well-ordered, Us contains a 
unique least ordinal, we will denote as, h(S). We will call the ordinal, h(S), the Hartogs 
number of the set S. Then h can be viewed as a class function, h : S — @, which asso- 
ciates to each set S in the class of all sets Z, a unique ordinal number, a, in the class of 
all ordinals @.1 


By Hartogs’ lemma (28.9), every set S in Z is assigned a unique Hartogs’ num- 
ber, h(S). For example, the Hartogs numbers, h(10), of the ordinal 10 is the or- 
dinal 11, while the Hartogs number, h(w + 7), of w + 7 is the ordinal w, = {a : 
a is a countable ordinal}.? We will use the above definition of Hartogs numbers to 
show how we can recursively construct an endless, strictly increasing, —>e-chain of 
uncountable ordinals. 


In the proof of the theorem below, we will use a function constructing procedure 
called “transfinite recursion”. We have encountered recursively defined functions be- 
fore when defining addition (15.2) and multiplication (15.4) on N. Using transfinite 
recursion to define a class function over the ordinal numbers is analogous to recursively 
defining a function on N (as shown in theorem 18.8). It is in fact a generalization 
of this process. The theorem which declares that recursively defined functions are 
well-defined functions is called the Transfinite recursion theorem. 


The procedure for the construction of a recursively defined function g : Ó — W on @ 
is as follows: 


Suppose W is a well-ordered set. 


— The first step is to assign a value a € W to g(0). 
— Then assuming that g(a) is defined for an ordinal a we define g(a*) in terms of 


g(a) according to a well-defined function f : W — W. That is, g(at) = f(g(a)). 
This is where the similarity with recursively defined definitions on N ends. 


— If 2 is a limit ordinal, we define g(3) = lub{g(a): a € P}. 


We will illustrate how the transfinite recursively defined functions are constructed in 
the proof of the following theorem. We will differ the proof of the Transfinite recursion 
theorem to the end of this section. 


See page 82 for the definition of “class function”. 
3The Hartogs’ number, h(w +7), of w +7 is not w+ 8 since w + 8 ~e w +7 ~e w. 
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Theorem 28.13 There exists a strictly <,-increasing class {wa : a E€ @} of (pairwise 
non-equipotent) infinite ordinals, all of which are uncountable except for wo = w. 


Proof: 


Let h denote the class function which associates to each set S the unique least ordinal 
number which cannot be embedded in S. The ordinal, h(S), is called the Hartogs number 
of S. By transfinite recursion, we define the class function g : Ê — @ as follows: 


g0) = w = CIN Hw 
glat) = wat = h(wa), forall a 
gq) = wa = lub{w}:y € a}, for all limit ordinals a 


Note that the function, h : @ — @, maps the proper class of all ordinals Ê into @. The 
Transfinite recursion theorem guarantees that the sequence 


{gla):a E€ O} ={wa: ae O} 


indexed by the ordinals is well-defined. 

We claim that the sequence {wa}aco is (strictly) >e-increasing: 

The proof of the claim is by transfinite induction. Let P(a) denote the statement “8 € a 

implies wg œe Wa”. 
Base case: The statement P(1) holds true since wọ = w is properly embedded in all 
uncountable sets (by 18.9) and so wo >e w1. 
Inductive hypothesis for the case where a is a non-limit ordinal: Suppose P(a) holds 
true for the non-limit ordinal a. That is, “8 E€ a > wg =e Wa”. By definition of the 
Hartogs number of the ordinal h(wa) = wa+t = Wat1 Fe B, for any subset B C wa. 
Then wa+1 Z= Wa and so wa E€ Wa+. This means that wa is order isomorphic to some 
initial segment of w,+ (by theorem 26.7). Since wa+ Że Wa, then wa Ge Wat. If BE at 
and 3 £ a, then wg ~e Wa Ce Wat; hence, wg Ge Wat+. So P(at) holds true. 





Inductive hypothesis for the case where a is a limit ordinal: Suppose P(y) holds true 
for all ordinals y € a where a is a limit ordinal. We claim that P(a) holds true. Let 
B € a. Weare required to show that wg e Wa. Since wa = lub{w, : y E a} and b € a, 
Wg E= Wa. Now wg is not equal to wa for if it was, wa = wg E wg+ €E {wy :7 € a} 
contradicting the fact that wa is an upper bound of {wy : y E€ a}. Then wg E wa. Since 
Wg =e Wet E Wa, then wg =e Wa. So P(a) holds true. 

By transfinite mathematical induction, for any ordinal a, 6 E€ a > wg Se Wa. 

Hence, the class, {wa : a E€ @}, constructed above is a class of (strictly) —>e-increasing 

ordinals, as claimed. 


Notation: From here on, the least infinite ordinal, “¢N, previously represented by w, 
will be represented as wo. 


We immediately establish a few facts about the class {wa :a€ Ø}. 
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Proposition 28.14 Let {wa : a E€ Ê} be the class of ordinals as defined in the previous 
theorem. 


a) Every element of {wa : a € G} is a limit ordinal. 


b) For every ordinal a € @, either a € wa or @ = wa. (Equivalently, Va € Ô, wa Z a.) 


Proof: 


a) Let wy E {wa : a E Ê}. We consider two cases: 1) y is a successor ordinal, and 2) y is a 


limit ordinal. 
Case 1: Suppose y = at, for some a. We claim that w, must be a limit ordinal. Suppose 
not. If wa+ is not a limit ordinal, then wa+ = GF = GU {8} for some ordinal 8. Since 
both wa+ and are infinite sets, by theorem 20.6, wa+ ~e BU {8} ~e B, S0 Wat ~e B. 
Furthermore, w,+ = 8 U {8} implies that 6 € wa+. That is, 8 is €-less than w,+. By 
definition, wa+ is the least ordinal such that wa+ %e Wa. Since B ~e Wa+, then 8 is 
an ordinal strictly less than w,+ such that O Ze Wa, a contradiction. The source of 
the contradiction is our supposition that w+ is not a limit ordinal. So any element in 
{wa : a E€ Ø} of the form w,+ must be a limit ordinal. 
Case 2: Suppose y is a limit ordinal. Then, by definition, wy = lub{wa : a € y}. We 
claim that w, must be a limit ordinal. Suppose not. That is, suppose wy = BU{G} = Br, 
for some ordinal 3. Then 8 € w, and p ~e wy. 
Claim: That 3 E= wy for some Y € y. 
Suppose not. That is, suppose wa € $ for all a € y. Then w, = lub{wa : a E€ y} E= 
B E w. Then £ is an upper bound of {wa : a E y} which is strictly less than w,, a 
contradiction. So 3 E= wy for some w € y, as claimed 


Since y is a limit ordinal, Y+ € y; hence Wyt E {wa : a E€ y}. Then 
Wy ~e B = wy E wy+ E {wa a E 7} 


contradicting the fact that w, is an upper bound of {wa : a € y}. The source of this 
contradiction is our assumption that wą is not a limit ordinal. We can only conclude 
that for case 2, wą is a limit ordinal. 


We are required to prove that: Va € Ô, @ E= wg. 

The proof is by transfinite induction. Let P(a) denote the statement “ a E- wa”. 

Base case: The 0-ordinal € wo since 0 belongs to all ordinals except the ordinal 0. So 
P(0) holds true. 

First inductive hypothesis: Suppose P(a) holds true for some a. That is, suppose 
@ E= We. We are required to show that at E€- wa+. Since a E= Wa, then either 
Q E Wa OF Q = Wa- 


Part VIII: Ordinal numbers. 297 


Case 1: Suppose a € wa. Then, since wa is a limit ordinal, at € wa E€ wa+. So P(aT) 
holds true. 
Case 2: Suppose œ = wa. Then again, since w,+ is a limit ordinal and wa E Wa+, 
at =wt € wa+. So P(at) holds true. 
Second inductive hypothesis: Suppose y is a limit ordinal, lub{wa : a E€ y} = wy and 
P(q) holds true for all a € y. That is, “a E= wa” for all a € y, 
We are required to show: That y E= wy. 
If  € U{wa: a E y}, then for some a € 7, Y E Wa E Wat+ E wy; hence, 





Ufwe ne E Y} Ewy 


Since y is a limit ordinal, y = U{a: a € y} (by theorem 27.15). 
Given that “a E= wa” for all a € y, then a C wa for all a € y and so, 


y=Ufa: ae 7} C Ufwa: a€ y} Cw, 


We conclude that y E= wy. 
By transfinite induction, œ €= We for all ordinals a. 


A chain of ordinals under two distinct relations. The class {wa : a E€ Ê} can be 
viewed as a chain of infinite ordinals which is strictly ordered by “e”: 








Wo E Wy E w2 E+ E Way E Wag tt E 07 E Way? E ++ © Wuguy E °° E Way Ee 


The class of ordinals {wa : a € Ê} is also inductively constructed in a way that wa 
is not equipotent to any of its predecessors. Then it is also strictly ordered by the 
proper embedding relation “>e”: 








Wo Fe WI Ce tt Ve Wug Pe Wag tl] Ve °° * Ve Ww? We + °* Ve Wu, Vest’ 


Amongst these, only wo is countable. This takes us considerably further down into 
the realm of uncountable sets. We already knew that the set {A”"(N) : n € N} was 
a countably infinite set of pairwise non-equipotent sets. What is new here is that the 
class, {wa : a E€ Ê}, is composed of as many pairwise non-equipotent well-ordered 
uncountable sets as there are ordinals! 


28.7 A particular property of the first uncountable ordinal w1. 


Consider the two sets R and wI = {0,1,2,3,...,w1}. These two sets have some 
properties in common. For example, they are both linearly ordered and are both 
uncountable. However, because of their structure, they can also be seen as being 
radically different in nature. For example, R has infinitely many uncountable initial 
segments (subsets of the form (—oo,a)), while wf has only one uncountable initial 
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segment, namely w; = [0,w,). The real numbers R has another property which is 
not shared with wf. It is well known that, for every element a € R, {a} is the 
intersection of countably many open intervals. For example, {a} = N{(a—+,a+4): 
n = 1,2,3,...}. We will refer to a set which is the countable intersection of open 
intervals in a linearly ordered set as a G5-set. For every a € R, {a} is a G5-set. We 
cannot say that wI shares the same property. We will show that {w1} is not a Gs-set 
in wI . We will first describe those subsets of wI which are open intervals. If a and 8 


are distinct non-zero elements of wI , the open intervals in wI are the sets of the form 


[0,a) = {K:0E€_K Ea} 
(Ga) = {K:BEKEea} 
(Bwt) = {k:bEk Ew} 
Suppose {(an, w] ) : n = 1,2,3,..., } isa countable set of open intervals each of which 


contains the ordinal w;. Then a, € w1, for all n. Let y = lub{a, : n =1,2,3,..., }. 
If y € {an : n = 1,2,3,...,} then an E€- y E€ yt € w, for all n. If y € fan: n = 
1,2,3,..., } then y = U{an : n = 1,2,3,..., }. Since y is the countable union of count- 
able sets, it must itself be countable (theorem 27.12). Then, again, a, E€ y E€ Y® € w1, 
for all n. We must then conclude that yt € D{(an,w]) : n = 1,2,3,...,} and so 
{w} # M(an,wf) : n = 1,2,3,...,}. That is, {w1} is not a Gs-set. However, 
there may still be some limit ordinals larger than wı which are G5-sets. Witness, 
{wi two} = N{ (w1 +n, wi +wo +1): n=1,2,3,...,}. 


The proof of the Transfinite recursion theorem. 


We end this section by proving that recursively defined functions over the ordinals @ 
are indeed well-defined. 


Theorem 28.15 The Transfinite recursion theorem. Let W be a well-ordered class and 
f:W — W hbe a class function mapping W into W. Let u € W. Then there exists a unique 
class function g : @ — W which satisfies the following properties: 


a) 
b) 


c) 


g(0) =u 
glat) = f(g(a)), Ya € 6 
g(G) = lub{g(a): a € 8}, V limit ordinals 8 


Proof outline : 


Let # denote the class of all subclasses of © x W which satisfy the three properties 
given in the theorem statement. That is, U € # if and only if 
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a) (0,u) EU 
b) [(a, x) € U] = [(a*, f(z)) € U], Ya € 6 
c) For any limit ordinal £, 


(a, £a) E U Ya € 6] = (8, lub{za:& E B}) EU 


Now # is non-empty since Ê x W satisfies all three properties and so @ x W is an 
element of #. Let G = Nyexw {U}. Then G is the smallest element of # (with respect 
to “C” ). The objective is to prove that the class G is the uniquely defined class function 
g:@—W that we seek. 

The first step is to show that G € #. This is straightforward and so is left as an exercise. 
The second step is to show that G is a class function, while the third step is to show that 
G is unique. 

We will show that G is a class function by transfinite induction. For each ordinal y let 


Gly = {(a, £a) EG: a E= 7} 
For example, 


G|o contains at least 


(0, u) 
G|ı contains at least (0, u) and (1, f(u)) 
G|2 contains at least (0, u), (1, f(u)) and (2, f(f(u))) 
G|3 contains at least (0, u), (1, f(u)), (2, F(f(u))) and (3, F(F(F(u)))) 


Let P(a) represent the statement “G|, is a function”. 

— Inductive hypothesis: Case 1. Suppose P(a) holds true for all a € ¢* for some non- 
limit ordinal ¢*. This means that Gg is a function. 
We are required to show that P(¢*) holds true. That is, we must show that G|,+ is also 
a function. Now G|g+ = Glg U {(¢*, x) : (67,2) € G}. We know that (¢, 24) € Glo 
so (oT, f(ae)) € {(o*, x) : (t,x) € G}. To show that G|ọ+ is a function it suffices 
to show that {(¢*, x) : (¢*,x) € G} is the singleton set {(4*, f(xg))}. Suppose not. 
That is, suppose there exists in G an element (7, y) such that y 4 f(«¢). 
Claim: G — {(7, y)} € #&. If so, then this contradicts the fact that G is the smallest 
element of #. The proof of the claim is left as an exercise. 





Assuming the claim is proved, we conclude that P(¢*) holds true. 

— Inductive hypothesis: Case 2. Suppose P(y) holds true for all ordinals a € y where y is 
a limit ordinal. This means that G|, is a function for all ordinals a € y. Equivalently, 
{(a, za) : a E€ 7, (a, £a) E€ G} is a function. 

We are required to show that P(y) holds true. That is, we must show that G|; is also 
a function. Now 


Gly = { (a, £a) : & E Y, (&, a) SG EU {(7, £4) : (Y, £7) E G} 
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Let s, = lub{zq : a € y}. We know, by definition of G, that (y, s4) € {(7, £y) : 
(y, v1) E€ G}. To show that G|, is a function it suffices to show that {(7, 2) : 
(y, £4) E G} is the singleton set {(7, s)}. Suppose not. That is, suppose there exists 
(y, y) such that y £ s,. 

Claim: G — {(7, y)} € &. If so, then this contradicts the fact that G is the smallest 
element of #. The proof of the claim is left as an exercise. 


Assuming the claim is proved, we conclude that P(y) holds true. 


Then by Transfinite induction “G|a is a function” for alla € @. 


We claim that Œ must then be a class function. Suppose not. Then there exists œ such 
that {(a, x), (a,y)} C G where x # y. Then, {(a, x), (a,y)} C Gla, where Gl, is a set 
shown to be a function. Since this is a contradiction, G must then be a class function, 
as claimed. 


The proof that G is unique is left as an exercise. 


We then define the class function g in the statement as g = G. 


Concepts review: 


1. 
2. 


10. 
11. 
12. 


13. 


Which ordering relation well-orders the class, @, of all ordinals. 


If S is a subset of ordinals in @ what is one way of describing its least element? 


. What can we say about initial segments of the well-ordered class @? 
. Is @ an ordinal number? Why or why not? 
. How do we define the immediate successor of an element of an ordered set? 


. Give an example of a linearly ordered set where no element has an immediate succes- 


sor. 


. State the two versions of the principle of induction over the ordinals. 


. What does it mean to say that elements of every well-ordered set can be indexed by 


the elements of some ordinal? 


. Which ZFC axiom is invoked to prove that every well-ordered set is order isomorphic 


to a single ordinal. 

What does “ordinality of a well-ordered set” mean? 

What does Hartogs’ lemma state? 

How does the existence of an uncountable ordinal follow from Hartogs’ lemma? 


What is the least uncountable ordinal? 
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14. 


15. 


What is the Hartogs number of a set S? 


How is the concept of Hartogs number combined with the Transfinite recursion theo- 
rem to show that there exists an infinite sequence of uncountable ordinal numbers no 
two of which are equipotent? 


EXERCISES 


1. Show that a well-ordered set can only be isomorphic to a single ordinal number. 


10. 
11. 


. We have seen that wọ + wọ is a limit ordinal. Describe the smallest limit ordinal 


which is larger than wo + wo. 


. Is the non-limit ordinal wọ + 2 equipotent with the limit ordinal w2? 
. What is the smallest ordinal number which is equipotent with w3? 


. Let Z (Q) denote the set of all subsets of the set of rational numbers Q. 


a) Construct a countably infinite subset S of A(Q) which is well-ordered by the 
relation C such that (S, C) is order isomorphic to the ordinal number wọ. Prove 
that C both linearly orders and well-orders the set S. 


b) Construct a countably infinite subset T of A(Q) which is well-ordered by the 
relation C such that (T, C) is order isomorphic to the ordinal number wo + wo. 


. Theorem 26.7 states that “any two well-ordered sets S and T are either order isomor- 


phic or one is order isomorphic to an initial segment of the other”. Can we replace 
the word “sets” with the word “classes” in this statement. Justify your answer. 


. Construct a set which is not an ordinal number but whose elements can be indexed 


by the elements of w5. 


. Consider the lexicographically well-ordered set S = {1,2,...,100} x N. State the 


ordinal number which is order isomorphic to the subset 


{(1,0), (1,1), (1, 2), (1,3), ..., (2,0)} 


Which ordinal number is order isomorphic to S? 


. Let S = {a € @: |a| < |N|}. Does S have a maximal element? If so what is it? If 


not state why. 

Show that there is an ordinal number a which is not equipotent with R. 

Let S = {a € @: |a| < |u|}. Does S have a maximal element? If so what is it? If 
not state why. 
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29 / Initial ordinals: “Cardinal numbers are us!” 


29.1 


Summary. In this section we formally define “initial ordinals”. We then prove 
that the class of all initial ordinals is precisely the class f = wo U {wa :a E O} 
defined at the end of the previous section. We state and prove (by invoking the 
Axiom of choice) the Well-ordering theorem. Finally we define the “cardinal 
numbers” as being initial ordinals. 


Initial ordinals. 


Hartogs’ lemma is one of the first statements which links ordinals to sets S$ in a 
way that does not depend on the structure of S. It states that “for every set S, 
Us = {a € Ê : a Pe Sanda, S} # Ø”. Since the class, Us, of ordinals is non- 
empty and @ is €-well-ordered, then it has an €- least element. We called this €-least 
element of Us the “Hartogs number”, h(S), of the set S. Since ordinals are sets, then 
every ordinal, a, has a Hartogs number, h(a). The Hartogs number of the ordinal, 
a, can be viewed as being the unique ordinal, h(a), satisfying the following properties: 


1. a E€ h(a) 
2. h(a) Że y, for every ordinal y in h(a) 


For example, the Hartogs number of the ordinal, wo + wo, is 
h(wo + wo) = w1 = {a € Ê : a is countable } 


since w4 is not equipotent to wọ +woọo nor to any of its elements, and, w1 %e B, for any 
BE {a €@:«a is countable } = w. The Hartogs number of wo +wọo cannot be w1 +3 
since wy +3 ~e w1 + 1 € w1 + 3, contradicting the fact the Hartogs number, h(a), of 
an ordinal a, cannot be equipotent to any element of h(a). Also, the Hartogs number 
of any finite ordinal, n, is h(n) = n+ 1 since n + 1 is not equipotent to n or any of its 
elements, and n + 1 %e m for anymen+l. 


By definition, the Hartogs number, h(a), of an ordinal, a, is never equipotent to any 
of its elements 8 € h(a). An ordinal which is not equipotent with any of its elements 
is given a particular name. 


Definition 29.1 We say that an ordinal, 8, is an initial ordinal if it is the least ordinal 
which is equipotent with itself. That is, 6 is an initial ordinal if œa E€ B >a fe B. 
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We already know of many initial ordinals. Trivially, every finite ordinal, n, is the 
least ordinal equipotent to n since the only ordinal which is equipotent to the natural 
number n is n. Also, no ordinal œa € wọ is equipotent to wọ, so wọ is seen to be the 
least countably infinite initial ordinal. Next in line is the ordinal, w1, shown to be 
the least ordinal not equipotent to any countable ordinal. It is then, by definition, 
the second infinite initial ordinal. Of course, wo + wo is not an initial ordinal since 
wo +2 E€ wo + wo where wo + 2 ~e wo ~e wo + wo. 


We will investigate the elements of the class, {wa : a € G}, of recursively constructed 
ordinals in theorem 28.13. We recall how these ordinals were defined: For a successor 
ordinal a = ¢ +1, wa as defined as being h(¢), while for a limit ordinal y, wy was 
defined as, wy = lub{we : a € y}. The above examples suggest that ordinals such as 
Wa are in fact initial ordinals. We introduce the following notation. 


I = {0,1,2,3,...,} U fog: ae O} 


We will show that the class, X, is precisely the class of all initial ordinals. 


Lemma 29.2 Every initial ordinal is an element of the class of ordinals, J. 


Proof: 


Given: w is an initial ordinal and Y = {0,1,2,3,...,}U{wa:ae O} 

Required to show: Y E€ I. 

If the initial ordinal, w, is a finite ordinal, then Y € J, and we are done. We then 
suppose that w is an infinite initial ordinal. 

By part b) of theorem 28.14, for any ordinal Y, Y E= wy. If Y = wy, then Y E€ Y and 
we are done. Suppose Y € wy. 

We claim: That Y = wg € wy, for some ordinal 8 (and so Y € J). 


The proof of the claim is by transfinite induction. 
Let P(a) denote the statement “If ~ is an infinite initial ordinal in wa, then Y = wg € 
Wa, for some ordinal 8”. 
Base case: If w is an infinite initial ordinal which belongs to w1, then Y = wo € w1. So 
P(1) holds true. 
First inductive hypothesis: Suppose P(a) holds true for some ordinal a. That is, for 
any infinite initial ordinal Y E€ wa, Y = wg E€ Wa, for some ordinal 3”. We are required 
to prove that P(at) holds true. Suppose Y% E€ wa+ = h(wa). Since both wa and w 
belong to wa+ either Y E= We or Wa E Y. 

— Case 1: If Y = wa, then we are done. 

— Case 2: If Y € wa, then by the inductive hypothesis, there exists some ordinal 8 

such that Y = wg € wa. 
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— Case 3: If wa E w, then, since w is an initial ordinal, wa %e Ww. Then wy+ = 
h(wa) E= vw. But this contradicts our hypothesis Y E€ wy+. So wa Z wv. This 
means, Y E= Wa. 

If Y = wa, then we are done. Otherwise, Y E€ wa implies Y = wg for some 3, by our 
inductive hypothesis. In both cases y € J. So P(at) holds true. 

Second inductive hypothesis: Suppose y is a limit ordinal and P(a) holds true, for all 

a € y. We are required to show that P(y) holds true. Let ~ be an infinite initial 
ordinal such that Y% E€ wy = lub{wa : a E€ y}. Then Y E wa, for some a € y. By the 
inductive hypothesis, there exists an ordinal @ such that Yy = wg € wa. So Y E F. 
Hence, P(y) holds true. 

This proves the claim that (Yy € wy) > (Y = wg E wy) for some ordinal 8. Hence, 
w E€ J, as required. 


Theorem 29.3 The class, 4% = {0,1,2,3,...,}U {wa :a€ Ê}, is precisely the class of all 
initial ordinals. 


Proof: 


We have shown in the lemma that {a € @: a isan initial ordinal } C J. To show 
that {a € @ : ais an initial ordinal } = J, it suffices to show that % C {aE CO: 

& is an initial ordinal }. Since we have already shown that the finite ordinals are initial 

ordinals it suffices to show that {wa : a € C} C {a€ Ê: ais an initial ordinal }. We 
prove the statement by transfinite induction. Let P(a) denote the statement “wa is an 
initial ordinal”. 

— Trivially, P(0) holds true. 

— Suppose P(a) holds true. That is, suppose wa is an initial ordinal. We are required 
to show that wa+ = h(wa) is an initial ordinal. Let 3 € h(wa). It suffices to show 
that 6 £e h(wa). Recall that h(wa) is the least ordinal which is not equipotent to 
Wa or any of its elements. If @ € h(wa), then the ordinal 3 must be equipotent to 
some subset of wa E€ wa+. Then 3 cannot be equipotent to h(wa). This means that 
Wa+ = h(wa) is an initial ordinal. So P(at) holds true. 

— Suppose y is a limit ordinal and P(a) holds true for all a € y. That is, wa is an 
initial ordinal for all a € y. We are required to show that w, is an initial ordinal. 
Suppose not. Suppose wy ~e B for some 3 € wy = lub{wa : a € y}. Then 8 E= wa 
for some a € y. Since wy ~e P E= Wa E Wat E wy, we have a contradiction. So wy 
is an initial ordinal. 





By transfinite induction, every element of {wa : a € Ê} is an initial ordinal. 


We conclude that J = wo U {wa : a E€ Ø} precisely represents the class of all initial 
ordinals. 
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29.3 The Well-ordering theorem. 


We say that a set, S, is well-orderable if a well-ordering of S is known to exist. For ex- 
ample, we have shown that every countable set is well-orderable. On the other hand, 
other than the uncountable ordinals such as w1, w3 + 4,ws5, we have not witnessed a 
single uncountable well-orderable set. To show that a set, S, is well-orderable and to 
actually produce an algorithm that well-orders S$ are two different things. Of course, 
producing an algorithm that well-orders S is more useful than simply proving that S 
is well-orderable. But sometimes, the best we can hope for is to prove that a set S is 
well-orderable, even though we may be convinced that no algorithm that well-orders 
S will ever be found. 


It may come as a surprise to many readers to learn that in the set-theoretic universe 
governed by ZFC, all sets are well-orderable (including uncountable ones such as R). 
The statement “All sets are well-orderable” proved below is called the Well-ordering 
theorem or the Well-ordering principle. It is a direct consequence of the Axiom of 
choice. 


Since the Axiom of choice plays a fundamental role in the proof of the Well-ordering 
theorem, it will be useful to remind ourselves of what the Axiom of choice states. 


Axiom of choice: Every set of sets has a choice function. 


The Axiom of choice states that if Z = {X : X € P(S), X # Ø} is an infinite set of 
non-empty subsets of a set S, then there exists a function f : Z > U{X : X € A(S)} 
which maps each non-empty subset X of S to some element x € X. The key point is 
that there is no globally stated rule which states which element is chosen from each set 
in an infinite set of sets. A convenient (often used) example is to imagine an infinite 
set Z whose elements are “pairs of socks”. The Axiom of choice states that their 
exists a function f which associates to each pair one sock... even if, in each pair, one 
is indistinguishable from the other. 


Theorem 29.4 [AC] The Well-ordering theorem. Every set can be well-ordered. 
Proof: 


What we are given: That S is a non-empty set. 

What we are required to show: That S is well-orderable. 

To do this, it suffices to show that S is the one-to-one image of some ordinal number. 
Then, by invoking theorem 26.1, we can conclude that S' is well-orderable. 
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Case 1: Suppose S is a countable set. If S is finite, then it is the one-to-one image of 
some finite ordinal (natural number), and so S is well-orderable. If S is infinite, then 
it is the one-to-one image of N (a well-ordered set). Again, we must conclude that S is 
well-orderable. 


Case 2: Suppose that S is uncountable. We will recursively construct a function g : a —> S 
which maps some ordinal œ one-to-one onto S. If such a function g is shown to exist, then 
S is the one-to-one image of a well-ordered set and so can be declared to be well-orderable. 


Let A(S)* = A(S) — Ø. By the Axiom of choice there exists a function 
f: A(S)* — S which maps each element X € #(S)* to some element x € X C S. We 
recursively construct a function g : Ê — S as follows: 


g(0) = so €S, for an arbitrarily chosen element so in S 
g(1) sı = f(S — g[{O}]) = F(S — {so}) 
g(2) = s2= f(S—gl{0, 1}]) = f(S — {s0, 51}) 


glat) = sa+ = f(S —g[{0,1,2,...,a}]) = f(S — {50, 51,..-, Sa}), Va E€ C 
g(6) = sg=f(S—{g(a):a€ B}) = f(S— {sa : a € G}), V limit ordinals 8 


For each y in the domain, dom g, of g define g|ẹ as: gly = {(a, Sa) €g: a E7}. 
Claim: For each ordinal, y, such that y C domg, g|y is a one-to-one function on y. 


The proof of the claim is by transfinite induction. Let P(a@) denote the statement 
“gla : a@— S is a one-to-one function mapping a into S”. 

Inductive hypothesis: Suppose y C domg. Suppose P(a) holds true for all a € y, 
where y belongs to the domain of g. We are required to show that P(y) holds true. 
That is, we are required to show that g|, is one-to-one on y. 


Suppose (6, sg) and (u, s,,) are two elements in g|, such that 6 € u. Then 6 and 
u are elements of y. It suffices to show that sg # s,. Case 1: If y is a limit ordi- 
nal, then (8, sg) and (u, su) belong to g|,,+ C gly. By the inductive hypothesis, g|,,+ 
is one-to-one on ut; hence, sg # Sp. Case 2: Suppose y is a successor ordinal. If 
ut #7, then by the inductive hypothesis, g|,,+ is one-to-one on u and, since (8, sp) 
and (j1,8,) belong to g|,,+, then sg # s,. Suppose wt = y. By definition of g, 
glu) = Sp = f (8 = {sa : œa E u}). Since 8 € u, Bee S -— {Sa : œ E u}; hence, 
8g # f(S—{sq:a€ u}) = su. Then gl, is one-to-one on y. 

By transfinite induction, g|, is one-to-one on y, for all y C domg, as claimed. We 
conclude that g is one-to-one on dom g. 


Claim : The function g maps dom g onto S. That is, for every s € S, (œ, s) € g for some 
ordinal a. 
Let D denote the domain of g. To prove the claim it suffices to show that S — g|D] = Ø. 
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— We first show that D satisfies the initial segment property: If y C D, then gly = 
{(a,5a) € g:a€ y} Cg. Hence, a € y > (a, Sa) E g > a € D. Then D satisfies 
the initial segment property in @. 

— The domain D of g is a set: Since S is a set and g : D — S is a one-to-one function 
on D C @, the image g~![S] of the one-to-one function g~! : S —> @ must be a set 
(by the Axiom of replacement). 


— The domain D of g is an ordinal: We have shown that the domain D of g is a set in 
O which satisfies the initial segment property. Then D 4 @ and so there exists an 
ordinal ô such that D = Ss; = {a € @: a € ô} = ô (by theorem 26.3, also see page 
273). So we can write D = ô. 


— It now suffices to show that S— g[d] is empty. Suppose the set S — g[d] is non-empty. 
Then the choice function f maps the non-empty set S — g/d] to some element ss 
in S — g[d]. Then g(ô) = ss, which means that ô € D = ô, a contradiction. The 
source of the contradiction is the assumption that S — g[d] is non-empty. Then 
g|D] = g|] = S. So every element of S is in the image of the function g, as claimed. 


The function g : 6 — S mapping 6 one-to-one onto S = {sa : a € 6} then induces a 
well-ordering on S. We conclude that any non-empty set S is well-orderable. 


29.4 Defining the cardinal numbers. 


Overview — We are now set to formally define the sets we call “cardinal numbers”. 
First, we review some background material on how we came to discuss the concept of 
“cardinal numbers”. 


Given the class, .”, of all sets, we defined a relation ~e on Z as follows: S ~e T if 
and only if S and T are equipotent. The relation, ~e, was shown to be an equivalence 
relation on Z and so allows us to partition Z into a class of equivalence classes. 
Every set S € .Y then belongs to some equivalence class of pairwise equipotent sets. 
We represented an equivalence class containing a set S as [S]e = {T E€ %:T ~e S}. 
For example, [R]e and [N]. are equivalence classes containing all sets which are 
equipotent to R and N respectively. Once we had verified that A(N) ~e R, for 
example, we could write that [R]e = [A(N)]- where R and A(N) were simply dif- 
ferent representatives of the same equivalence class. When we first discussed the 
concept of “cardinal numbers”, the tools available at that time were insufficient to 
construct a class of sets whose elements could serve as representatives for each of 
the equipotence-induced equivalence classes.! So we postulated the existence of the 
class of cardinal numbers as follows (reproduced from Postulate 22.2): 


‘One may suggest that we could have defined, [S]-, as being the cardinal number of S. The problem 
with this is that we want cardinal numbers to be sets; the ~--equivalence class, [S]-, may be a proper class. 
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There exists a class € of sets such that 1) N C @, 2) every set S is equipo- 
tent with exactly one element KE C . 


We now have all the ingredients required to prove that a class of sets whose properties 
characterize the cardinal numbers exists in ZFC. 


Theorem 29.5 The class of all initial ordinals, % = wo U {wa : a E€ G}, satisfies the 
following properties: 


1. Every set S is equipotent to exactly one element in J. 


2. Two sets, S and T, are equipotent if and only if they are equipotent to the same 
element of J. 


3. The class, J, is €-linearly ordered. 
Proof: 


1) Let S be a set. By theorem 29.4, the set S has a well-ordering “<”. When equipped 
with this well-ordering, (S, <) is order isomorphic to some ordinal 8 (since every well- 
ordered set is order isomorphic to some ordinal number). We know there exists a unique 
initial ordinal wa which is equipotent with 8. Then wa is the unique initial ordinal which 
is equipotent to the set S. 


2) Suppose S and T are equipotent. Then by part 1) both S and T are equipotent to 
initial ordinals w, and we respectively. Since distinct initial ordinals cannot be equipo- 
tent, wy = Wa. Conversely, equipotent initial ordinals must be the same ordinal and so 
the elements of the class of all sets which are equipotent to the same initial ordinal are 
pairwise equipotent. 


3) Every pair of ordinals in @ are €-comparable. So every pair of ordinals in 4% C @ 
must be €-comparable. So J is €-linearly ordered. 


Definition 29.6 Cardinal numbers. An ordinal is called a cardinal number if and only if 
this ordinal is an initial ordinal. When the elements of J are viewed as cardinal numbers 
we represent J as €. 


29.5 Aleph notation. 


Although we could use the “wa” notation to represent the cardinal numbers it is cus- 


tomary to use the aleph notation, Ny. That is, we set No = wo, Ny = w1, No = wa, 
more generally, for any ordinal a, Na = Wa. For example, when we write the expres- 
sion X; we are thinking “the cardinal number N” rather than “the initial ordinal w1” 
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even though they represent the same entity. Since the initial ordinal, 8; = w1, is, by 
definition, “the least ordinal number which is not countable” it is the first uncount- 
able cardinal (ordinal). If we assume the Continuum hypothesis, then there exists no 
cardinal number, Na, such that |N| = Xo Ge Na >e |A(N)| = |R| = c (theorem 21.3); 
hence, Ni = w1 = c is the least ordinal which is not countable. 


Similarly, assuming the Generalized continuum hypothesis, the least uncountable or- 
dinal which is not equipotent to w is the cardinality of the set A(R), in which case 
we say “assuming GCH, the cardinality of A(R) is X2” where Nə is the least ordinal 
which is not equipotent to Xj. 


The class of all cardinal numbers @ can now be represented as 
C=NoU{Na i: QE O} 


where, for all ordinals a, Xa+ is the least ordinal which is not equipotent with Na and, 
for limit ordinals y, Xy = lub{Nq : a € y}. What is also new, is that the class of all 
infinite cardinal numbers is indexed by the ordinal numbers. 


Even though every cardinal number is an initial ordinal, the context usually allows 
us to determine whether we are referring to a set’s “cardinality” or “ordinality”. We 
normally refer to the “ordinality of a set” only if we have a specific (or hypothesized) 
well-ordering of that set in mind. In the following tables, we describe how the car- 
dinality, and ordinality of a set are perceived when we are assuming the Generalized 
continuum hypothesis and when we are not. 


Without assuming CH nor GCH (in the presence of the Axiom of choice): 








{a,b,c} 3 3 3 
Notandard No Wo Wo 
Wo + Deuwelicondarad No Wo Wo + 3 
{1, 2} x Niexico No Wo wo2 
N x Niexico No wo wowo 
Wy Ny Wy Wy 
R cH De RS Ry Wa 
F(R) 2a = Ng > Nati we 
P(F(R)) 23e = N, > Noi Wy 


P(P(P(R))) | X =R; > Ri By 
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Note that it is the Well-ordering theorem (itself a consequence of the Axiom of choice) 
which guarantees that for every ordinal, 3, 2%¢ = |238| = |A(Ng)| is equal to some 
(initial) ordinal number wy = Xj > Ng4i1. That is, the Well-ordering theorem states 
that P (Ng) ~e Ny, for some cardinal Xy > Nei. Without the Axiom of choice, 
the set, A(Xg), may not be well-orderable in which case it would not necessarily be 
equipotent to some ordinal number. 


Assuming GCH, cardinal numbers are more clearly defined: 








Set S cardinality of S | initial ordinal of S ods 
{a,b,c} 3 3 3 
Niiandara No wo wo 

Wo + 3e-well-ordered No Wo wo +3 
{1, 2} x Mexico No Wo wo2 
N x Niexico Ro wo wowo 
R 20 = Ny Wy 
RxR Ny Wy 
P(R) 281 = No w2 
P (P (R)) 2X2 = N3 W3 
P(P(PR))) 25s = Ny w4 
Na Wa 


What does this say about GCH? The Axiom of choice guarantees that every set can be 
well-ordered and so all sets can be ranked on an “equipotence based scale @ of sets” 
called the cardinal numbers. This means that every set is associated to a uniquely 
specified ordinal number (cardinal number) on this scale of ordinals. We make the 
following universe comparisons. 


In the ZFC — universe: For any infinite set S such that |S| = 8y, 2° ~e P(S) ~e 
Na for some a > y. The value of a is guaranteed to exist, but cannot be determined. 
The value of a is simply assumed to be equal to some ordinal greater than or equal 
toy+1. 


In the ZFC + GCH- universe: If S is any infinite set such that |S| = NX}, then 
25 ve P(S) ~e Ny41. The cardinality of the set 25 ~e P(S) is the least cardinal 
number (on the equipotence based scale) which is larger than 8. The axiom GCH 
limits the size of power sets Y(S) relative to the size of S. 


In the ZFC + CH— universe: For any infinite set S such that |S| = Ny > No, 
25 we P(S) ~e Na for some a > y. The value of a is guaranteed to exist, but 
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29.6 


cannot be determined. It is equal to some ordinal greater than or equal to y + 1. 
But if |S| = No, 2° ~e P(S) is the immediate successor cardinal, N1, of No. So 
CH only limits the the size of 25°. 


In the ZFC + ~CH— universe: For any infinite set S such that |S| = Ny > No, 
25 we P(S) ~ve Na for some a > y. The value of a is guaranteed to exist, but 
cannot be determined. It is equal to some ordinal greater than or equal to y + 1. 
But if |S| = No, 2° ~e P(S) is not the immediate successor cardinal, N1, of No. 
That is, 280 > N4. 


Ranking the elements of the class Z of all sets in ZFC with —,. Suppose S and T 
are two infinite sets in .Y. If S and T are equipotent, then they can be viewed as being 
the “same size” (just like a set of five squirrels and a set of five submarines are viewed 
as being the same size, in spite of the fact that squirrels and submarines are entirely 
different types of objects). Suppose now that S and T are not equipotent. We wonder 
whether one of these two infinite sets is necessarily embedded in the other. We show 
that it must be the case. Say the relation <s well-orders S and <r well-orders T. 
(The Well-ordering theorem guarantees that such well-orderings exist for each of these 
two sets.) Suppose a =*"S and 8 =""*T. Now a and 8 cannot be the same ordinal 
number for if they were, then S and T would be equipotent. Then, either a € 8 or 
GB E€ a. Suppose, without loss of generality, that œ € B. The set S is order isomorphic 
to an initial segment of T. It follows that that S —, T. We have shown that any 
pair of non-equipotent sets S and T are ~,-comparable. Note that comparing sizes 
of sets in this way would not be possible without the Well-ordering theorem (which 
follows from the Axiom of choice). 


Finally, since the class, {Na : a € Ø}, of all infinite cardinal numbers is indexed by 
the elements of @, no two of which are equipotent, we can then say that our universe 
of sets contains as many different infinite set sizes as there are ordinals! 


Some consequences of the cardinal number definition, @ = S. 


Knowing that the cardinal numbers are in fact initial ordinals (which in turn are known 
to be limit ordinals) will allow us to prove certain properties of cardinal numbers, 
a task which was beyond our reach unless we knew more about the nature of the 
those sets we call “cardinal numbers”. We first remind ourselves of the definition 
of the “product of two cardinal numbers”: If « and A are cardinal numbers then 
k X A= |K x L| where « =|K| and à = |L]. 


We have already provided a few properties of cardinal number multiplication in the- 
orem 23.4. In particular, we showed that No x No = No. We can now move a step 
further by showing that for any ordinal a, Na X Na = Na. We begin with the following 
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lemma. 


Lemma 29.7 For any infinite cardinal number, «x, define a relation <, on «K x « as follows: 
For pairs, (a, 3) and (y, Y), of ordered pairs in «K x k, 


aUBeyuw 
or 


(a, B) <« (y, Y) whenever BeEwwhenaUlB=yUw 


or 
a E€ y when a U 8B = y Uy and 8 = 4% 


Then <, well-orders k x kK. 


Proof: 


The relation <, is easily verified (on a case by case basis) to be transitive on « x kK. It 


is also easily verified that if (a, 3) <x (y, Y) and (y, Y) Z» (a, B), (a, 8) = (y, Y) and so 
<, linearly orders «K x k. 


Claim: Every non-empty subset of k x k contains a least ordinal pair element with re- 
spect to <x. 


Proof of claim: Let U be a non-empty subset of « x k. Consider the set 
A={d: 6=aU@ for some (a, 3) E€ U} 


If (u,y) E€ U, then uU y E€ A, so the set A is non-empty. Since A is a subset of the 
well-ordered class @, there exists an €-least element wa € A. Let Va = UN {(a, 2): 
aU B= pa}. Let 

B={6: (a,6) € Va} 


The set B is non-empty and so there exists an €-least element ug € B. Let Vg = 
Va N {(a, 8): 8 = ug}. Let 


C={6: (ð, uB) E€ VB} 


The set C is non-empty and so there exists an €-least element uc € C. See that the 
element (uc, uB) is the <,-least element of U. 
Then every non-empty subset U of k x k has an <,-least element, as claimed. 


Then <, well-orders the ordinal pairs in the set «K x kK. 


Part VIII: Ordinal numbers 313 


For example, 


(Ni, No+ 5) <x (No, R3) since NU (No + 5) = (No + 5) E N3 = Na U N3 
(Xz, Ng + 9) Sy (Xz, Ny) since N7U (N3 + 9) = N7 = Ny UN, and (N3 + 9) EN, 
(N3, Nz) <x (Xz, R7) since N3 U N7 = N7 = N7 U Ny and N7 E Ny and N3 € N7 


The above lemma shows that for any infinite cardinal number «, K X « is well-orderable 
and so any product, k x «, of an infinite cardinal « with itself is order isomorphic to 
some ordinal number ju. 


Theorem 29.8 [AC] For any ordinal a, Na X Na = Na. 
Proof: 


Since we have previously shown that the product of infinite countable sets is countable, 
then No X No = No. We will prove the general statement by transfinite induction. 


Inductive hypothesis: Suppose Na x Na = Na for all a € 6 for some ordinal 6. We are 
required so show that Ns x Ns = Ns. By the lemma, <, well-orders Ns x Ns and so Ns x Ns 
is order isomorphic to some ordinal ys. It will suffice to show that Ns = u. Since 


Ng ~e Ng X {1} Ce Ng X NG ~e H 


then Ns €= u. To show that Ns = u, it suffices to show that Ns € u is impossible. 
Claim: Xs € u is impossible. Suppose Ns € u. Suppose f : u — Ns x Ns is the order 
isomorphism mapping u onto Ns x Ns. Then, since Ns € u, 


FNs) = (ao, bo) E€ F(u) = Na x Na 


Since ag and ĝo are ordinals in Ns, then ap U Jp is an ordinal in Ns, itself a limit ordinal. 
So ao U 6o + 1 € No. Since Ng is an initial ordinal, |a U 8 + 1| € Xs. By our induction 
hypothesis, 

aU 8B +1|x]|au b +1] =]|au Bt 1] E Ns 


A E A Soop ies E E ake 
[8s] = { (a, 8) E Na x Ns : (a, B) <x (ao, Bo)} C (a0 U Bo + 1) x (ao U Bo + 1) 
We then have: 
Ns Se T EE 41) 
~e |(aUG+1)x (aUBt+1)| 


laUB+1|x jaUBt1) 223) 
ja U b + 1| € Ns (Inductive hypothesis) 
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We have obtained the contradiction Ns —>e Ns. Then Ns € u is impossible, as claimed. 
So u = Ns. That is, y x u = Ns x Nsg = No = p. 
By transfinite induction, for any ordinal a, Na X Na = Na, as required. 


Corollary 29.9 Let k be an infinite cardinal and {Ag : a € 8} be a set of non-empty 
sets indexed by the elements of the ordinal 6 €= «k where |A,| €= & for all a € 8. Then 
|U{Aa: a E B}| E= k. 


Proof: 


We are given that « is an infinite cardinal number, |8| €= « and for each a € 8, the 
cardinality of |A,| E= K, Aa is non-empty. 
Since, for each a € 8, |Aa| E= k, then for each a € 8, there exists an isomorphism, 
fa: Aa `> K, Mapping Aa one-to-one into the initial ordinal x. 
For each x € Ag, let d(x) = least of {a € B: x E€ Ag}. 
We define the function h : U{Aq : a € B} > k x k as follows: 
h(x) = (6(@), foca)(@)) € K x [As] C= KX K 
Claim: That h is one-to-one on U{ Aa : a E€ 8}. 


h(x) =h(y) => (ê), fsc (2)) = Gl), Fay) 


=> 0(x) = ð(y) 
=> fiw lt) = fia (y) = fsa (Y) 
SS Y (Since fa : Aa —> «k is an isomorphism for all a.) 


So h maps U{ Aa : a E 3} one-to-one into & x K, as claimed. 


Then 
[U{Aa:@e P} = [A[U{Aa:a€ PH] 
E=- |kxk] 
= |r] x Iai 
= KXK 


= K (By the previous theorem.) 


So |U {Ag: a E B}| E= k, as required. 


Corollary 29.10 For any infinite cardinal number k, k" = 2*. 
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Proof: 


Since 2 < k, 2" < K" (by theorem 24.3). 
It now suffices to show that 6” < 2": 


Let f € x". Then f is function which maps « into K. Then f C k x k = k (by theorem 
29.8). Then f € A(«) where |A(K)| = 2". Then K! < 2%, as claimed. 


We conclude that k! = 2%. 


29.7 Properties of some large cardinal numbers. 


We have seen that the class of all infinite cardinal numbers, {Na : a € Ê}, is the class 
of all initial ordinals {wa : w E€ Ê}. Recall that limit ordinals are those ordinals y 
which do not contain a maximal element (since the maximal element of an ordinal, 
if it has one, is its immediate predecessor). Equivalently, limit ordinals are those 
ordinals, y, which satisfy the properties, lub(y) = y and y = U{a: a € y} (27.15). 
It is interesting to reflect on properties possessed by some cardinals perceived to be 
considerably larger in size then others. We must proceed cautiously, since the ele- 
ments of the class of all cardinal numbers are indexed by the elements of the class of 
all ordinal numbers (a few of which are themselves cardinal numbers). We start by 
providing the following definitions. 


Definition 29.11 


a) 


We say that an infinite cardinal number, N4, is a successor cardinal if the index, y, has 
an immediate predecessor (i.e., y = G+ 1, for some 8). The expression, Na+ = Na+, 
denotes a successor cardinal. We say that an infinite cardinal number, Ny, is a limit 
cardinal if y is a limit ordinal (i.e., y = lub{a:a € y}). 


We say that a limit cardinal N, is a strong limit cardinal if N} is uncountable and 
{2% : a ey} Cry. 


Limit cardinals should not be confused with limit ordinals. All infinite cardinals, Na, 
are limit ordinals. But not all infinite ordinals are limit cardinals. See that 





Ny = Rop = Not 
No = Nip = N+ 
Nioo = Ngg+1 = Ngg+ 


are successor cardinals and so are not limit cardinals, even though they are all limit 
ordinals. A limit cardinal number is a cardinal number which is not attainable from 
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below by a finite sequence of successor cardinals. Limit cardinals can be easily recog- 
nized as being those infinite cardinals, X4, whose index, y, is a limit ordinal. We see 
that the first four limit cardinals are Xo, Nuo, Nwo-+wo: Nwotwotwo since 


n E€ No 
Rn E Nay 


n+t1ENg 
Rn+1 € Roo 


Nuotn € Nwotwo Rootn+i € Ruiotwo 


UUL 








Ruotwo+n € Nuotwotwo Nuotwotn+1 € Ruotwotwo 

On strong limit cardinal numbers. The notion of a strong limit cardinal is a bit more 
difficult to grasp. Since wọ is the first limit ordinal, then the first uncountable limit 
cardinal is Xu). Is Nuo “strong”? The answer to this question depends on which 
assumptions are made. Consider, 5 € wo. Is 255 necessarily an element of Roo? For 
5 € wo, 235 > Ng (since |255] = |P(Ns5)| > N5, while Ng = we is the least ordinal which 
is not order isomorphic to ws). But does 25 belong to Ruo? It is possible, but the 
given inequality doesn’t guarantee that 235 € Nu However, if we assume GCH, then 
definitely, 285 = Ng € Nwo. 


This example suggests that when assuming GCH, determining when a limit cardinal 
is “strong” is pretty straightforward. We prove the following result. 


Theorem 29.12 [GCH] Every uncountable limit cardinal is a strong limit cardinal. 


Proof: 
Suppose ÑN, is an uncountable limit cardinal, and suppose a € y. Since Ny is a limit 


cardinal, then, by definition, Na; E€ Ny. Assuming GCH, we have Wa = Nayi E Ny. 
We have shown that a € y > 2e € Ny. So Ny, is a strong limit cardinal, as required. 


But what if we don’t assume GCH? Are there any strong limit cardinals in the ZFC- 
universe? Interestingly enough, we can show that in the presence of the Axiom of 
choice, there must be. 


Theorem 29.13 [AC] There exists a strong limit cardinal number. 


Proof: 
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Let @* = {Na : a E Ê, a Æ 0} denote the class of all uncountable infinite cardinals. 
Since we are assuming the Axiom of choice every set is well-orderable and so, for each 
ordinal a, A(Xq) is well-orderable and so A(X.) has a cardinality, |Z (Xa)|. By 
theorem 20.12, 2e ~e P(Na); hence, 2%« is well-orderable and so has a cardinality: 
[25a] = [2| Rel = Wa = |P(Nq)| E F*. 
We define the function g : @* — @* as follows: 

g(a) = ane 
Let U = {rpn : n € wo} C G* be recursively defined as follows: 


ko = No 

kı = g(ko) = 2% 

k2 = gļkı)=2" 

k3 = g(kK2) = 2° 
Katy = glkn)=2" 


Then {rn : n € wo} is a countable (strictly) increasing sequence of cardinals. It follows 
that y = U{kn : n € wo} is an infinite ordinal which is the least upper bound of the set 
U (theorem 27.12). 


Claim: The ordinal y is an infinite cardinal number. 


It suffices to show that y is an initial ordinal. Suppose ĝ is an ordinal in y. It suffices 
to show that |8| < |y]. Since 8 € y, then, for some m, 


BE km = 2°71 E 2°" = Kyi EY 


Then |8| < km < Km+1 < |y|. Since the cardinality of 8 is less than |y|, then y is an 
(infinite) initial ordinal, and therefore is cardinal number, as claimed. 


Since y is an infinite cardinal number, there is an ordinal «x such that 
Re = 
Claim: The cardinal &, is a strong limit cardinal. 
Suppose a € «K. It suffices to show that 25% € N. Then 
Na E Nk = Y = UfKn : n E€ wo} 
Then Na € Km for some m € wy. It follows that 
Nat E= Wa c fm — Km41 E Ñk (25m =xm41 by construction of {xn : n € wo}) 


Since Na+1 € Nk, Nx is a limit cardinal. By definition, 2e € N, implies, Nẹ is a strong 
limit cardinal number. 
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We have shown that strong limit cardinals exist in a ZFC-universe. Unfortunately the 
proof cannot tell us how to construct one or what it looks like. Strong limit cardinals 
are completely elusive. But, ethereal as they may be, we can safely make the assump- 
tion that they exist. It is important to see that GCH plays no role in the above proof. 
In a ZFC+GCH-universe strong limit cardinals are everywhere and easily seen since 
they are the limit cardinals. 


We now present other categories of cardinal numbers. 


Definition 29.14 


a) 


We say that an infinite cardinal, N4, is a singular cardinal number if N4 is the least 
upper bound of a strictly increasing sequence of ordinals, {a, : k € 3}, indexed by 
the elements of some ordinal ( in X4. 


An infinite cardinal N} is said to be a regular cardinal number if it is not a singular 
cardinal number. That is, there does not exist an ordinal, 3, in N} such that N}, = 
lub{a, : K € B}. 


The following limit cardinals are examples of singular cardinals: 


Noo = lub{X,:n € wo}, where wo E Ruo 
Ruotuwo = lub{Ruo+n : N E wo}, where wo E Nuotwo 
Nuos = lub{Nuo2+n : N E wo}, where wo E Nuys 
RNuowo = lub{Noon : Nn Ewo}, where wo E Rugwo 


Nu. = lub{Na:Qa Ewi}, where wi = N1 E Nu, 
The above examples suggest that there can be no upper bound to the class of singular 
cardinals since if y is any ordinal larger than wo, 


Nyt = lub{Ry4n : Nn E wo}, where wo € Ry 


How easy is it to find (infinite) regular cardinal numbers? Our first example of a 
regular cardinal is a simple one. 


Example: The countable limit cardinal No is a regular cardinal since, for any non-zero 
m E No, lub{n:n em} =m—-1#No. 


Determining whether a given cardinal number is regular or singular case by case can 
be tedious. It will be helpful to determine a few principles that will help us distinguish 
those limit cardinals which are regular from those that are singular. 
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Theorem 29.15 Every infinite successor cardinal, Na+ = Na41, is a regular cardinal. 


Proof: 


Suppose N,+ is a successor cardinal number. To prove that it is regular we will suppose 
that it is singular and show how this leads to a contradiction. Suppose that @ € Na+ such 





that Na+ = lub{a, : k E€ 8} where {a, : k € 8} is an increasing sequence of ordinals. 
Then |8| €= Ng. Since Na+ is a limit ordinal, X,+ = Ufa, : k € B} (theorem 27.12). 
Now Na+ = Na+1 is the least ordinal whose cardinality is larger than Ng. We then have 
lak] E= Na for all K € £. 
Then 
Nat = [Ro+| 
= |Ufax: KE p) 
E= Na (IB| E= Xa, |ax| E= Na, and by corollary 29.9.) 


So Na+ E= Na, a contradiction. The source of our contradiction is our assumption that 
Na+ is singular. Hence, Xa+ must be regular, as required. 


The above theorem provides us with a tool for constructing infinite regular cardinal 
numbers. For example, the following are infinite regular cardinals: 








Ny = No+ 
Ny 41 = Nyt 
NXg + = Ret 


Theorem 29.16 Let y be an infinite cardinal number. If y is a singular cardinal, then the 
cardinal number N, is a singular cardinal. 


Proof: 


Given: That y is a singular cardinal. That is, there exists an ordinal 6 € y such that 
y= lub{a, :« € 3} where {a, : « E€ 8} is an increasing sequence of ordinals. 

Required to show: That N} is a singular cardinal. 

Since y is declared to be an infinite cardinal, it is a limit ordinal. Then N} is a limit 
cardinal. 

Claim: That Ry = lub{Na, : K E P}: 

It suffices to show that lub{Na, : K E B} E= Ny and Ny E= lub{Ny, : K EP}. 
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— First note that since {a, : k € B} is increasing, 
[ax E€ 7] > Ran E Ny] for all k € 8 


Then lub{Na, : K € B} E= N4. 

— Let u € Ny. Then |u| E= u E Ny. Let 6 be such that Ns = |u| € Ny. Since 
N, is a limit cardinal, u E€ Ns+1 € Ny; hence, ô +1 € y. Since y is a limit 
ordinal y = U{a, : k € B} and so Ô +1 E a,,, for some ko € 8. Then 
u E Niti E Nang E= lub{Na, : K € GB}. It follows that y € lub{Na, : K € P}; 
hence, Xy €= lub{Ng, : K E P} 

We conclude that 8, = lub{Na, : £ € 3}, as claimed. 

Since 3 €= Ng (by theorem 28.14) and Ng € N}, 6 E Ny. So NX, is singular, as required. 





The above theorem allows us to recursively construct infinite sequences of singular 
cardinals. For example, since we have shown that the cardinal Xag is singular, then 
the set 


pte 


Ruo Ruo Rew? N Rien, 
0 


contains only singular cardinals. 


Inaccessible cardinal numbers. 


The examples we provided of singular cardinals were all limit cardinals. We then 
showed that No and every successor cardinal is a regular cardinal. However, uncount- 
able regular limit cardinal appear to be elusive. One may be tempted to conclude 
that no uncountable limit cardinal can be regular. But none of the ideas expressed 
above allows us to arrive at such a conclusion. If a regular limit cardinal exists 
what would it look like? Suppose Ng is an uncountable regular limit cardinal. Since 
{Ni, No, N3, N4,,.-., } are all successor cardinals, Nuo E= Ng. Now 8 must be a limit 
ordinal; hence, we can argue (as in the proof of the theorem immediately above) that 
Ng = lub{X.: a € 8}. If 8 € Ng, then the fact that Ng = lub{Ny : a € 8} would 
imply, by definition, that Ng is singular. Since Ng is regular, it must be the case that 
B Z Ng. Hence, if Ng is an uncountable regular limit cardinal, it must be the case that 
B = Ng (part b) of theorem 28.14). 


Uncountable regular limit cardinals (if they exist) are referred to as “inaccessible” 
cardinals, possibly because, if some exist, they are difficult to find. 


Definition 29.17 A regular cardinal number which is a limit cardinal is called an inacces- 
sible cardinal. A regular cardinal number which is a strong limit cardinal is called a strongly 
inaccessible cardinal. 
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29.8 


Recall that a strong limit cardinal Ng is a limit cardinal which satisfies the property 
“k E Na => 2" © Na”. We have shown that strong limit cardinals exist in a ZFC- 
universe. We have also seen that in a universe governed by ZFC + GCH, strong limit 
cardinals are the same as infinite limit cardinals. So if we assume GCH, all weakly 
inaccessible cardinals are strongly inaccessible. 


Interestingly enough, it has been shown that the existence of inaccessible cardinals 
cannot be proven from ZFC. This means that if we assume that the existence of in- 
accessible cardinals, this will not lead to a contradiction that is not already there. 
Those mathematicians whose mathematical statements depend on the existence of 
inaccessible cardinals are roaming in the “ZFC + J inaccessible cardinals” —universe, 
one which is larger than the one which is governed by ZFC. 





Cofinality of a cardinal number 


Suppose A and B are two subsets of a partially ordered set (P,<). We say that the 
subset B is cofinal in A if for any a € A there exists b € B such a < b. A set is always 
cofinal in itself. 


Examples: 

— The set B = {0,2,4,6,...} is a cofinal subset of No, since for any natural number 
n € No there exist 2n € B such that n < 2n € B. In this case we see that No has a 
cofinal subset B such that |B] = |No| = No. 

— The set of ordinals wə + 3 is not cofinal in wo + 4 since wo + 3 € wo + 4 but there 
is no a E€ w2 + 3 such that w2 + 3 <a. In fact, the only subset of wọ + 3 which is 
cofinal in wọ +3 is wọ + 3 itself. We generalize: Every successor ordinal a has only 
a as cofinal subset. 

— We can show, on the other hand, that the set N7 is not cofinal in Ng: We know that 
Ng is the least ordinal whose cardinality is larger than N7. Then N7 € N7U{N7} € Ng 
(since the cardinality of Ng is strictly larger than the cardinality of N7U{N7}). Since 
k E€ X7 U{N7} for all k € N7, N7 cannot be cofinal in Xs. 

— In fact, it seems that all cofinal subsets of Ng must have cardinality Ng. We verify 
this hypothesis. Let us suppose that B is a cofinal subset of Ng such that |B| < Xz. 
Then we can index the elements of B = {ba : a < B € Ng} where |8| < N7 (noting 
that any proper subset B of Ng has cardinality Ny or less). Now if lub(B) = y < Ns 
we can choose y + 1 € Ng where ba < y+ 1 for all ba E€ B. Since this would 
contradict the fact that B is cofinal in Ng, we must have that lub(B) = Xs. By 
corollary 29.9, |lub(B)| = | U {ba : a < G}| < Xz. We have a contradiction. Hence 
cofinal subsets of Xg must have cardinality Ng. 

— If y is an ordinal then a subset B of y is cofinal in y only if lub(B) = U{b: b € 
B} = y (for if lub(B) = K < y then b < k +1 < y for all b € B). 
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We see that studying the cardinality of cofinal subsets of cardinals might allow us 
to distinguish even further certain types of cardinals from others. This leads us the 
definition of the cofinality of cardinal numbers. 


Definition 29.18 Let N, be an infinite cardinal. We say the cofinality of ÑX} is @ and 
write cf(Xj) = £ if 2 is the least ordinal in N, which indexes an increasing set of ordinals 
{0a : œ < B} such that Ny = lub{@, : a < 8}. If no such £ exists in N} then we say the 
cofinality, cf(N,), is Ny and write cf(N,y) = Ny. 


We see that, for an infinite cardinal number N4, cf(%,) = 6 < N4 if and only if there is 
a smallest set of increasing ordinals {0a : a < 3} which is cofinal in X, (in the sense of 
the definition of “cofinal subset” described above). Given this definition of “cofinality 
of a cardinal number” we can now restate our definition of “singular” and “regular” 
cardinal numbers as follows: 


An infinite cardinal number N, is said to be a singular cardinal if only if 
cf(N,) < Ry. If cf(&,) = Ny then Ny is said to be a regular cardinal. 


So singular cardinals are those infinite cardinals that have a proper cofinal subset. We 
showed above that every successor cardinal X, is such that cf(%,) = Ny and so are 
regular cardinals. We then defined the inaccessible cardinals as being those cardinals 
Ny which are both limit cardinals and satisfy cf(N ) = Xy. 


Whenever N4 is singular the definition of cofinality cf(,) of Xy is declared to be 
the smallest ordinal 3 which satisfies a particular property. We will show that the 
cofinality, cf(%,), of a any cardinal X} must also be a cardinal. 


Theorem 29.19 The cofinality cf(&,) of an infinite cardinal number N4 is a cardinal num- 
ber. Hence cf(N}) is the smallest cardinality of all sets which are cofinal in X4. 


Proof: 


The statement is obviously true if X4 is a regular cardinal (that is, if cf(®,) = N4). Suppose 
that XN, is a singular cardinal. Then cf(X,) = 6 < Ny. Then Ny = lub{0, : a < 8} for 
some strictly increasing function h : 6 — Ny such that h(a) = 0a. We are required to 
show that 8 is a cardinal number. Suppose not. Suppose N; = |6| = |cf(Ny)| < 8. Then 
the ordinal &) is equipotent to 8. Let f :&), — 8 be a one-to-one function mapping N) 
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onto 8. Let g: N) — 8 be defined as: 


to = 9(0) = f(0) 

Hi=g(1) = elb{f(1), f(2),..., all of which > g(0)*} > g(0) 

u2 =9(2) = glb{f(2), f(3),..., all of which > g(1)*} > g(1) 

us =9(3) = glb{f(3), f(4),..., all of which > g(2)*} > g(2) 

Ln =g(n) = glb{f(n), f(n4+1),..., all of which > g(n—1)*} > g(n) 
uk =g(k) = glb{f(«), f(k+1),..., all of which > g(5)t when ô < K} 


Then uk} = glk +1) > g(k) = we for all k € Ny; hence g : Nà — £ is strictly 
increasing on its domain. Since f is onto 8, for each a € 8, there exists ka E Ny 
such that g(ka) = a € B. Then N} = lub{0a : a < 6} = lub{u, : k < Na} where 
Lx = t(k) = h(g(K)). This contradicts the fact that 8 is the least ordinal such that Xj = 
lub{@, : a < 3}. We must conclude that, for any cardinal N4}, cf(N}) is a cardinal number. 


Hence cofinalities of cardinal numbers are cardinal numbers. We can now show that 
the cofinality of a cardinal number must be a regular cardinal. 


Theorem 29.20 The cofinality cf(y) of an infinite cardinal number ¢ is a regular cardinal. 
Proof: 
Let y be a cardinal and ô = cf(y). To show that ô is regular it suffices to show that 


cf(6) = 6. We know that cf(d) < ô. It will then suffice to show that cf(d) £ 6. Let 8 = 
cf(d) and suppose 8 < 6. We will show this will lead to a contradiction. Then 


ô= lub{tg:a < 6} 


where Ha = f(a) € 6 for some strictly increasing function f : 8 — 6. Since 6 = cf(y) 
then 

p= lubl@,:n <ô} 
where 0, = g(K) € y for some strictly increasing function g : 6 — y. Then the function 
gof : 3 — vy is a strictly increasing function which maps £ into y. 
So y = lub{6, : k < 6} = lub{¢a : a < B} where ġa = t(a) = g(f(a)). Since 8 < ô 
this contradicts the fact that 6 is the least ordinal which allows the cofinality condition 
to hold true. Then cf(d) £ ô. Hence cf(d) = cf(cf(y)) = cf(y) = 6. So cf(y) is a regular 
cardinal. 
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We have seen that for infinite cardinals «x, it occurs that xò is sometimes equal to 
k (for example (2%) = 2*** = 2%). The following statement shows a relationship 
between « and à which guarantees that «k < Kò. 


Theorem 29.21 If « is an infinite cardinal and cf(«) < À, then k < kò. 


Proof: 


We are given that « is infinite and cf(x) < A. We are required to show that «x < Kò. 
Case 1: We first consider the case where « is a regular cardinal. In this case we have 
k = cf(k). From the hypothesis we obtain s < A. By 24.3, k" < kò. Since k < 2" = K" 
(by 29.10), then k < Kò, as required. 

Case 2: We now consider the case where « is a singular cardinal. That is, suppose 
cf(k) = yp < rk where y < à. Then & = lub{ ua : a E€ y} where {ua : a E€ ¢} is a strictly 
increasing set. Then «s = U{ ua : a € vy} (since « is a limit ordinal). 


We claim U{ ua : a € Y} < K?: 
Suppose not. That is, suppose K? < U{ Ha : a € Y}. Then there exists a function f : kK? > 
U{ ua : @ € p} mapping K? one-to-one into Uffig : a E€ Y}. Then f7} is well-defined and 


onto K?. Notice that if k € k? then k can be expressed as k = (ko, K1, K2,---;Ka;+--;) 
where Ka E€ k when a € y. Then Kk? can be expressed in the form of the product Hageyk. 
We define the map mq : Ilgegk  K aS Tq(Ko, K1, K2, +++, Kgs +++) = Kq- 


For q € p, fo [ug] = {k E K? : f(k) € uq < K}. For each q € y, let 
Ka = malf [ha] C 6 


Next, for each q € y, choose xy € k— K; (Choice!). Then use the selected xg’s to construct 
the point 
£ = (£0, £1, Pires Zq- ) E Harcok 


There must exist some point y € U{ Ha : a € y} such that f~'(y) = x. Then there exists 
some q € y such that y € fg, hence f(y) € f [bg]. Thus ma(ft(y)) = q(x) = tq € 
Tal f~ [Mg] = Kq, contradicting xy € k — Kg for all q € p. The source of the contradiction 
is our assumption that K? < U{ Ha : a € y}. Then U{ ua : a € Y} < K’, as claimed. 


We then have k = U{ Ha : a € Y} < K? < kò (since p < A and by 24.3). Then « < Kẹ, as 
required. 


Concepts review: 


1. What is a well-orderable set? How is it different from a well-ordered set? 


A. 
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2. What can be said about those sets that are the one-to-one image of an ordinal number. 
3. Is it true that any well-ordered set, no matter how large, is order isomorphic to some 
ordinal number? 

4. What does it mean to say that the well-ordered set S has order type (ordinality) a? 
5. What does the Well-ordering theorem say? 
6. The Well-ordering theorem is a consequence of which fundamental ZFC-axiom? 
7. What is an initial ordinal? Are natural numbers initial ordinals? Why? 
8. What is the least uncountable initial ordinal? How is it obtained? 
9. Are all ordinals in wo U {wa : a € Ê} initial ordinals? 

10. Define the “cardinal numbers”. 

11. If we assume the Continuum hypothesis, what is the cardinality of R? 

12. If we do not assume the Continuum hypothesis, what is the cardinality of R? 

13. What is the cofinality of a cardinal number? 

14. Define singular and regular cardinal number. 

15. What kind of cardinal is said to be inaccessible? 

16. Is the cofinality of a cardinal number necessarily a cardinal number? 

17. Give examples of regular cardinals. 

18. Give examples of singular cardinals. 

19. What is a limit cardinal? 

20. What is a successor cardinal? 

EXERCISES 


1. Consider the set of ordinals defined as S = U{wn : n € N,n > 2}. 
a) Is S = @? Why? 
b) 
c) If S is an ordinal number, is it a limit ordinal number? 
d) What is the least upper bound of S$? 


Is S is an ordinal number? 
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2. List a few possible order types of a set of cardinality No. 


A Ww 


CON OQ Oo 


10) 


11. 
12. 


13. 
14. 


. What is the cardinality of a set whose ordinality is w3 + w?? 
. Use the appropriate symbols to represent the ordinality, the associated initial ordinal 


and the cardinality of the set S = NU {N} (ordered in the usual way). 


. Does the Cantor set have a well-ordering? 

. If we assume CH, what is the least ordinal that is not equipotent to R? 
. What is the least ordinal which does not belong to -4? 

. What is the least limit ordinal that does not belong to J? 


. Let S = {1, 2,4, 8, 32,64}. Define the order relation < on S as follows: a < b if and 


only if a divides b. 

a) Is < a well-defined order relation on S? 

b) Is < a well-ordering of S? 

c) Express (S,<) as a an element of P(S) x P(S x S) by explicitly exhibiting all 
of its elements. 


Let S = {aa : a E wo +w} be a set whose elements are defined as follows: 
2 — zH when &€wo 
do = 2 when Q=u 


24+ yay when wo €aQ 


where flwo +n) =n. 

a) Are the elements of the set S well-defined? 

b) Is the ordering induced on the elements of S by the index set wọ + w a well- 
ordering? 

c) If the elements of S are assumed to be well-ordered by the index set, what is the 
least element of the set S with respect to this ordering? 

d) What is the least upper bound (supremum) of the set {aq : a E€ w} with respect 
to the ordering defined by the index set. 


Show that if S is a class of ordinals, then the least upper bound of S is U{a € @: 
a ES}. 


Is it true that given any two sets S and T, either S is embeddable in T or T is 
embeddable in S? Why? 


Is the class J of all initial ordinals an initial segment of ©? Why? 
Is the class Y of all initial ordinals a transitive class? 
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More on axioms: Choice, 
regularity and Martin’s axiom 
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30 / Axiom of choice 


30.1 


30.2 


Summary. In this section we prove that the Axiom of choice is equivalent to 
the Well-ordering principle. We provide a few mathematical statements whose 
proof requires the Axiom of choice. Finally we present Zorn’s lemma and show 
that it is equivalent to the Axiom of choice. A proof of the fact that every vector 
space has a basis is given by invoking Zorn’s lemma. 


Introduction. 


We have seen that there is a subtle difference between proving that a mathematical 
object “exists” and actually “constructing” or “exhibiting” this mathematical object. 
If we can construct a mathematical object, then this can serve as a valid proof that 
this object exists. But some mathematical objects can be proven to exist without 
ever being able to provide a concrete example of this object or providing a method 
to construct it. For example, we have proved in the Well-ordering theorem (a con- 
sequence of the Axiom of choice) that there exists, in a ZFC-universe, a function 
which maps w; one-to-one onto R. But we have no way of determining what such a 
function is. “Most” mathematicians are comfortable with the idea of manipulating a 
mathematical object whose existence has been proven even if they believe that this 
same mathematical object can never actually be constructed or witnessed. For a few, 
however, to put a non-constructible mathematical object whose existence has been 
proven, on the same level as a mathematical object which can actually be constructed 
is nonsense. This, in a nutshell, describes the controversy surrounding the use of the 
Axiom of choice. We will try to develop a deeper understanding of what this axiom 
is about. The Axiom of choice will be seen to be equivalent to other mathematical 
statements which many find more palatable. 


The Axiom of choice 


The Axiom of choice plays a fundamental role in the study of mathematics. This 
principle was invoked in the proofs of numerous mathematical statements long before 
mathematicians began acknowledging it as an axiom. It is worth restating it here: 


The Axiom of choice: Let Z be a set of nonempty sets whose union is the 
set, T, of elements. Then we can choose some element s from each set S 
in Z. That is, there exists a function, f : Z — T, with domain, Z, such 
that for each set, S E€ Z, f(S) ES. 


The function f : Z — T described in this paragraph is referred to as a “choice func- 
tion”. 


Given a set of non-empty sets, Y = {Sa : & E y E Ê}, we do not always have to 
invoke the Axiom of choice if we wish to select an element s from each set S E€ Y. 
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We consider the following example in which we wish to select a single element from a 
single non-empty set. 


Existence of a choice function for a single non-empty set. Suppose we are given a 
non-empty set S for which no specific element of S can be distinguished from the 
others. Suppose we wish to choose some element in the set S. If we argue as follows, 
it is not necessary to invoke the Axiom of choice: 


SØ => P(S) 
=> P(S) x SAO (By definition of Cartesian product.) 
=> {S} x AØ (Since se Ms).) 
= there exists an element (S, x) € {9} x S 


Observe that {(S, x)} is a function with domain {5} and range {x} which associates 
to S some element x of S. We did not invoke the Axiom of choice to postulate the 
existence of the function f(S) = z. 


Selecting an element from each set of non-empty sets from P(N). Consider the set 
P(N)* = A(N) — {Ø} of all non-empty subsets of N. Suppose we want to form a new 
set, S = {n4 : A E Y(N)*}, by selecting from each set, A, in Y(N)* a single element, 
na. We can argue as follows: Since the set N has been shown to be well-ordered, we 
can choose from each non-empty set A € Y(N)* the unique smallest number, n4, in 
A. The Axiom of choice is not required, since each element A in the sets of A(N)* is 
specifically and unambiguously identified as being the unique least element in A. We 
can express our choice function, f : A(N)* — N as follows: 


f ={(A, na) € A(N)* x N: n4 = unique least element of A } 


In this case, the Axiom of choice is not required. 


Suppose, on the other hand, that we are given an infinite set of identical golf balls 
distributed in infinitely many boxes, % = {Aa : a € y}, indexed with the ordinal 
numbers in such a way that no box, Ag, is empty. We are asked to construct a 
function, f : Æ — U{ Aa : a € y}, which chooses a single ball from each box. Note 
that the golf balls are neither labeled nor indexed, in the sense that they are all 
identical in all respects. So there can be no formula for a function f which globally 
states which specific ball is to be chosen in each box. In this case, the best we can do 
is to invoke the Axiom of choice which guarantees that at least one choice function, 
f: A — U{Ag: a € y}, exists. 

It was shown by Kurt Gödel in 1938, that no contradictions can result from invoking 
the Axiom of choice. In 1963, Paul Cohen showed that the Axiom of choice cannot 
be proved from ZF. So the Axiom of choice adds new sets (since functions are sets) 
to a ZF-universe of sets. 
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The following more general theorem guarantees that the existence of a choice function 
for finitely many sets does not require the intervention of the Axiom of choice. 


Theorem 30.1 Suppose Z is a finite set of non-empty sets whose union is the set M. 
Then there exists a function f : S — M which maps each set to one of its elements. 


Proof: We prove this by induction on n € N. 


Let P(n) be the statement: “A system of n sets has a choice function.” 

Base case: The statement P(0) is vacuously true. 

Inductive hypothesis: Suppose the statement P(n) holds true. 

Suppose Z is a system of n + 1 non-empty sets whose union is the set M. Let U € JZ. 
Then Z — {U} is a system of n sets. By the inductive hypothesis, a choice function 
g: Z —{U} > M exists. 

Since U is non-empty, there exists an element, say (U,u) € {U} x U. Define the choice 
function f : Z — M as follows: 


(T) if TES -{U} 
fny={ 8 if T=U 


So P(n + 1) holds true. By mathematical induction, every finite system of sets has a 
choice function. 


A word of caution. The proof above does not show that any countably infinite set 
of sets has a choice function. The statement P(n) states that “a set of n sets has a 
choice function” no matter what the value of n is. It only proves that all finite sets 
have a choice function, nothing more. 


30.3 The Axiom of countable choice. 


The Axiom of countable choice is a weaker form of the Axiom of choice. We state it 
formally: 


The Axiom of countable choice. Let Z be a countable set of nonempty sets 
whose union is the set, T, of elements. Then we can choose some element 
s from each set S in Z. That is, there exists a function f : Z — T with 
domain Z such that for each set S € Z, f(S) € S. 


Since the Axiom of countable choice is a special case of the Axiom of choice, it obvi- 
ously follows from it. If T is countable, then it is well-orderable, and so the Axiom 
of countable choice is not required to justify the existence of a choice function. If 
T is uncountable and we do not assume the Axiom of choice, then T may not be 
well-orderable. In this case the Axiom of countable choice is required to justify the 
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existence of a choice function on Z = {Sp : n € N}! 


The reader may wonder why we cannot generalize the arguments used to prove the 
existence of a choice function on finite sets of sets to justify the existence of a choice 
function on an infinite number of subsets of an infinite set S. The main reason is that 
if there are are infinitely many non-empty subsets F = {Sa : œa € y} of an infinite set 
S, then, for each set Sg E€ F, we would have to invoke the existence principle, 


{Sa} x Sa Æ Ø = there exists (Sa, S) € {Sa} x Sa 


This means that this existence principle would have to be invoked infinitely many 
times. The ZF-axioms do not define a formula with an infinite chain of existence 


symbols. 


4 Equivalent forms of the Axiom of choice. 


There are many mathematical statements which are equivalent to the Axiom of choice. 
One of the simplest statements is the following one. 


Theorem 30.2 Let [AC*] denote the statement: 


Th 


Pr 


“For any set, S = {Sa : a E y}, of non-empty sets, HacySa is non-empty.” 


e Axiom of choice holds true if and only if [AC*] holds true. 


oof: 

(=) Suppose the Axiom of choice holds true. Let Z = {Sa : a E y} be a set of non- 
empty sets. Then there exists a choice function f : S —> UgeySq such that f(Sa) = Sa 
where sq is some element in Sa. Then (sq : a € y) E€ HaeySa. Hence, Nae Sq is non- 
empty. Then [AC*] holds true. 


(<) Suppose the statement [AC*] holds true. Let Z = {Sa : a E y} be a set of non- 
empty sets. Since IIyeySq is non-empty, it contains some element (sq : a € y) E HaeySa- 
Then, for each a € Y, Sa € Sq. The function, f : S —> UacySa, defined as, f(Sa) = Sa, 
is well-defined. Then the Axiom of choice holds true. 


We will now prove that the statement “Every set is well-orderable” and the Axiom of 
choice are equivalent statements. 


'Paul Cohen has also proven that the Axiom of countable choice cannot be proven from the ZF-axioms. 
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Theorem 30.3 The statement “Every set is well-orderable” holds true if and only if the 
Axiom of choice holds true. 


Proof: 


(<) That the Axiom of choice implies “Every set is well-orderable” is proven in Well- 
ordering theorem 29.4. 


(=) What we are given: That T is a well-orderable set and Z is a class of non-empty 
subsets of T. 

What we are required to show: There exists a function f : S — T which maps each set 
Sin Z to some element s € S. 

Since T is well-orderable there exists a function f mapping some ordinal a € Ê one-to- 
one onto T. For any S € Z, f~'[S] is a non-empty subset of œ and so must have a least 
element a* (since a is well-ordered). Then f(a”) is the least element of S with respect to 
the well-ordering induced on T by a. The function g : S — T defined as g(S) = f(a*) 
is a well-defined function mapping each set S in to one of its elements, as required. 


30.5 Some consequences of the Axiom of choice. 


The Axiom of choice has already been invoked a few times in the earlier sections to 
prove important statements which are unprovable without it. There are many such 
statements, intuitively felt to be true, which cannot be proven from the ZF-axioms. 
That is, the Axiom of choice is the only key we can use to unlock certain “doors” 
which would remain closed otherwise. A simple example is the following statement: 


If f: S — T is a function mapping S onto T, then there exists a function 
g:T — S mapping T into S such that fog = Ir, the identity map on T. 


This statement seems obviously true. It is trivially true if the function f is one-to-one 
and onto. If f is not one-to-one, at first glance the proof appears to be straightfor- 
ward. It would go something like this: 


- Since f is onto T, for each y € T, fT [{y}] is non-empty. 

- For each y € T, choose uy E€ fT Hy}. 

- Define the function g : T — S as follows: g(y) = uy. 

- Then (fog)(y) = f(g(y)) = f(uy) = y. So fog is the identity function on T. 


A minor flaw in this proof is that the statement “For each y € T, choose uy € f~ [{y}]” 
is not appropriately justified. The Axiom of choice must be invoked to justify the 
choice of an element in each set of an infinite number of sets. 
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We state a few other well-known statements whose proofs depend on the Axiom of 
choice. That is, the following results hold true only if we accept the Axiom of choice 
as an axiom along with the other ZF-axioms. 


“Every infinite set contains a one-to-one image of the natural numbers.” 
The proof is given in theorem 18.9. 


“Any infinite set can be expressed as the union of a pairwise disjoint set of 
infinite countable sets.” 


The proof is provided in the theorem below. 


Theorem 30.4 [AC] Any infinite set can be expressed as the union of a pairwise disjoint 
set of infinite countable sets. 


Proof: 


What we are given: That S is an infinite set. 
What we are required to show: That there exists a set F = {Ua : a E€ $} of countably 
infinite pairwise disjoint sets such that S = U{U, : a E€ ¢}. 


We will recursively construct the set F: 


— Let 7 = {F € P(S): F is countably infinite}. Since S is a set, then A(S) is a set 


and so J is a set. Since S is infinite, J must contain at least one countably infinite 
subset Uo of S (By theorem 18.9). 

If S — Up is finite, then S is countably infinite and we can let F = {S}; we are done. 
More generally: Let Ky = {Ua : a € y} be a set of countably infinite pairwise disjoint 
subsets of S indexed with the elements of the ordinal y. Either S — U{U, : a € y} is 
finite or it is infinite. 

- If S— U{Ua : a € y} is infinite, then choose an arbitrary element Uy of 7 
which is entirely contained in S — U{U, : a € y}. We then obtain the set 
Ky41 = {Ua : & E€ y+1} of countably infinite pairwise disjoint subsets of S. The 
Axiom of choice will allow us to make the selection of U, for all such sets K} of 
countably infinite sets. 

Let F = {Ua : a E ¢} be the set of all countably infinite sets obtained in this 
way. Then ¥ is a set of pairwise disjoint countably infinite subsets of S. Now either 
U{Ua : a € >} is equal to S or it is not. If it is equal to S, then we are done. If 
it is not equal to S, then S — U{U, : a € ¢} is finite. In such a case we can throw 
those last few elements in Ua for some a € ¢. We then obtain S = U{U, : a E€ ¢} as 
required. 
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30.6 Zorn’s lemma 


Zorn’s lemma is one of the most commonly used statements which is equivalent to 
the Axiom of choice. It refers to a specific property of a partially ordered set. Recall 
that a chain of a partially ordered set X is a linearly ordered subset of X. A maximal 
element of the partially ordered set is an element m of X for which there does not 
exist an element x of X such that m < x. Zorn’s lemma states: 


“Tf every chain of a partially ordered set (X,<) has an upper bound, then 
X has a maximal element.” 


We will first prove that the Axiom of choice implies Zorn’s lemma holds true. This 
will be followed by the proof of its converse. 


Theorem 30.5 [AC] Let (X,<) be a partially ordered set. If every chain of X has an 
upper bound, then X has a maximal element. 


Proof: 


What we are given: That X is a partially ordered set. For every chain C in X, there is 
an element kc in X such that c < ko for all c € C. 

What we are required to show: That X contains an element m such that no element x 
of X satisfies the property m < a. 

Let o be the Hartogs number of X. That is, ¢ is the least ordinal number which is not 
equipotent with any subset of X. 

Proof by contradiction. Suppose X has no maximal element. Then, for every element 
s E€ X, the set s* = {x : x > s} is non-empty. The Axiom of choice guarantees the 
existence of a choice function f : A(X) — X which maps the set s* to some element 
f(s*) € s*. 


We recursively define the function g : 6 — X as follows: 


g0) = zm = f(X) 
g1) = zı = f(x5) > 20 
g(2) = z2 = f(c) ee 
glat) = tar = f(T) > Ta 
If A = limit ordinal, g(A) = TA = upper bound of the chain {za :Q E A} 


In the case where À is a limit ordinal, the hypothesis guarantees that the upper bound 
of the chain {£a : a € A} exists in X. Since the function g is strictly increasing it is 
one-to-one. Since X has no maximal element, the function g maps ¢ = {a : a € ¢} 
one-to-one into X, contradicting the fact that X’s Hartog number ¢ is the least ordinal 
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which cannot be mapped one-to-one into X. The source of the contradiction is our 
assumption that X has no maximal element. We must conclude that X has a maximal 
element, as required. 


Theorem 30.6 [ZL] Suppose that those partially ordered sets (X, <) in which every chain 
has an upper bound must have a maximal element. Then, given any subset Z C A(S)—@, 
there exists a choice function f : Z — S which maps each set in Z to one of its elements. 


Proof: 


What we are given: That partially ordered sets (X, <) in which every chain has an upper 
bound have a maximal element. That Z C Y(S)-—@. 
What we are required to show: That there exists a choice function f : Z — S which 
maps each set in Z to one of its elements. 
Let 

F ={f: 2 — S: fisa choice function with domain J Cc S} 


Note that F is non-empty since it has been shown that all finite sets of sets have a 
choice function. We will partially order the functions in F by inclusion “C”. That is, 


f cg HKS, sE Z xS: f(S)=s) c {(S,s)e Xx S:9(8)=s}] 


Let C be a chain in (¥,C). Then U{f : f € C} is an upper bound of C which is 
contained in F. Then every chain in the partially ordered set (F,C) has an upper 
bound. By hypothesis, (4, C) has a maximal element, say h. 

We claim that h: Z — S is a choice function on J 

Suppose not. That is, suppose there exists a non-empty element S* € Z which does 
not belong to the domain of h. Let h* = h U {(S*,s*)}. Since S* is non-empty, then 
there exists an element s* in S*. Then h C h* € F. This is a contradiction, since h 
was declared to be a maximal element of F. We conclude that h : Z — S is a choice 
function for Z. 

Hence, the Axiom of choice follows from Zorn’s lemma. 


30.7 A consequence of Zorn’s lemma 


We now provide a proof of a statement normally encountered in a course of linear al- 
gebra. In first year courses, this statement is often stated without proof. Postulating 
this statement is easier than postulating the more abstract Zorn’s lemma. We begin 
by recalling a few facts. 


A basis B of a vector space V is a non-empty subset of V with special properties. 
This set B need not be finite. But it must satisfy two properties: 
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1) Every finite linear combination of elements in B which equals zero must have only 
zeroes as coefficients. This is called the linear independence property. 

2) Every vector in V is a linear combination of finitely many elements of B. In such 
cases we say that “B spans V”. 


There are algorithms we can use to find a basis for a finite dimensional vector space. 
But some vector spaces do not have a finite basis. For example, the vector space of all 
countably infinite sequences of real numbers, (a1, a2, a@3,..., ) does not have a finite 
basis. But if we assume Zorn’s lemma, we can prove that it has a basis. But this 
proof does not show how to construct it or produce an explicit basis for vector spaces 
which have no finite spanning family. In fact, for this vector space, no basis can be 
found. We prove below that a basis exists for any vector space. The acronym [ZL] 
next to the theorem statement informs the reader that the proof invokes Zorn’s lemma. 


Theorem 30.7 [ZL] Every vector space has a basis. 


Proof: 


Let V be a vector space and let (.¥, Cg) be the set of all linearly independent subsets 

of the vector space V ordered by inclusion Cg. The set ¥ is non-empty since non-zero 

singleton sets are linearly independent. 

Let @ be a chain of linearly independent subsets in ¥. 

We claim that the union Uge¢C is also linearly independent: 

It suffices to show that every finite linear combination of elements of UcegC which 

equals zero must have zeroes as coefficients. Let U = {v1, v2, v2,..-,Un} be a set of 

vectors in Uge¢C. 

- Then U C C for some C € @ (since C is a chain of subsets). 

- Since Č is linearly independent, then œv + agv2 + a3v3 +--+: + AnVn = 0 implies 
ay = Q2 =: = Qn = 0. 








So UcegC is linearly independent as claimed. 


Then every chain @ in (F,Cg) has an upper bound in F. 

By Zorn’s lemma, (.¥, Cg) has a maximal linearly independent set B*. That is, B* is a 
linearly independent set that is not a subset of any other linearly independent set. We 
now show that B* spans V. Ifv € V — B* is not a linear combination of vectors in 
B*, then B* U {v} is a linearly independent subset of V which properly contains B*, 
contradicting the maximality of B*. So B* spans V. So B* is a basis of V. Thus, every 
vector space has a basis. 


Concepts review: 


1. What does the Well-ordering principle say? 
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2. What does the Axiom of choice say? 
3. Is the Axiom of choice required to justify the existence of a choice function for finite 
sets? 
4. Is an axiom required to justify the existence of a choice function for countably infinite 
subsets of P(S)? 
5. What does the Axiom of countable choice say? 
6. Provide an example of a statement whose proof requires the Axiom of choice. 
7. Does the Axiom of countable choice follow from the Z F-axioms? 
8. State Zorn’s lemma. 
9. What linear algebra statement can be proved by invoking Zorn’s lemma. 
EXERCISES 


1. For which of the following sets of sets is the Axiom of choice required to guarantee 
the existence of a function which selects an element from each set: 


a) An infinite set of sets .Y where each set in Z contains one element. 


o 


Three sets each containing all elements of R. 


Q 


A countably infinite number of sets each containing all the rational numbers. 


aA 


Uncountably many sets each containing three identical golf balls. 


) 
) 
) 
) A countably infinite number of sets each containing uncountably many golf balls 
and one marble. 


e 


f) An uncountably infinite number of sets each containing a pair of socks. 
g) An uncountably infinite number of sets each containing two blue marbles and 
one red one. 


2. Let U and V be non-empty sets. Suppose R is a relation in U x V with domain 
T CU. Prove that there exists a function f :T — V such that f C R. 

3. Let U and V be non-empty sets. Prove that a function f : U — V maps U onto V 
if and only if there exists some function h : V — U such that foh is the identity 
function on U. 


4. Let S be a non-empty set. Suppose U is a non-empty subset of the set Z (S) partially 


ordered by “C”. Prove that lub U = Uzeuyz. 
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C. 5. If S is a set containing more than one element, show that there exists a one-to-one 
function f : S — S such that f maps no point x in S to itself. That is, f(x) # x for 
all x in S. 


6. Let Z be a set of sets. Let M = {U € P(.Y): X,Y c U implies XN Y # Ø}. 
Show that .@ contains a maximal element T with respect to the ordering “C”. That 
is, T € @ and, for any BE Z-T, TU{B} g M. 
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31 / Regularity and the cumulative hierarchy 


Summary. In this section we state the Axiom of regularity and present some 
of its equivalent forms. We prove that the Axiom of regularity is equivalent to 
the statement “Every set has an €-minimal element”. We also show that in the 
presence of the Axiom of regularity, no set can be an element of itself. We define 
“well-founded sets” and show that in the presence of the Axiom of choice, the 
Axiom of regularity is equivalent to the statement “Every set is well-founded”. 
The transitive closure of a set is defined. Finally we show how to construct in 
stages, starting with the empty set, a class of sets whose union, V, contains all 
sets. The class, V, is referred to as the “Von Neumann’s universe”. The Axiom 
of regularity is used to show that this class, V, indeed contains all sets. We then 
define the “cumulative hierarchy” and the “rank of a set”. 


31.1 Equivalent forms of the Axiom of regularity 


The Axiom of regularity is the only ZFC axiom we have not yet invoked to justify 
any steps in the proofs of theorems presented up to now. We restate this axiom: 


Axiom of regularity: Every non-empty set S has an element x such that 
TASSA: 


To help us better understand the meaning of the Axiom of regularity — also known 
as the Axiom of Foundation — we investigate some of its equivalent forms and conse- 
quences. In what follows, we will say that m is a minimal element of the set S with 
respect to the order relation “<” if S is ordered by “<” and S contains no element x 
such z < m. 


Theorem 31.1 The Axiom of regularity holds true if and only if every non-empty set S$ 
contains a minimal element with respect to the membership relation “e”. 


Proof: 


(=) 

Given: For any non-empty set S, there exists an element m such that mN S = Ø. 
Required to show: That every non-empty set S contains a minimal element with respect 
to the membership relation “Ee”. 

Let S be a non-empty set and m be an element of S such that mN S = Ø. Suppose m is 
not a minimal element of S. That is, suppose S contains an element x such that x € m. 
Then z E€ mN S ¥ Ø, contradicting our hypothesis. Then this element m is minimal in 
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S with respect to “€”, as claimed. 

(=) 

Given: That every non-empty set S contains a minimal element m with respect to “c”. 
Required to show: That S contains some element m such that mN S = Ø. 

Suppose S$ is a set such that for every m € S, there exists x E€ mM S 4 Ø; then no 
element m in S is minimal with respect to €. This contradicts our hypothesis. Hence, 
for every set S there exists m such m N S = Ø. 


Up to this point in our study of sets, we have not encountered a set x such that 
x E€ x. We thought it would be best to stay away from such “creatures”, at least 
until we better understand the difficulties that they may cause. We will now see that 
the statement “No set can be an element of itself” is a consequence of the Axiom of 
regularity. 


Theorem 31.2 [Axiom of regularity] No set is an element of itself. 


Proof: 


Let x be an element and S = {x}. Suppose x € x. Then, x € £ N S. Then, for every 
element, x, of S, x E€ £N S Æ Ø, This contradicts the Axiom of regularity. Then, for 
any set x, £ É x. 


So the ZF'C-set theoretic universe contains no set which is an element of itself. Recall 
that in the formal definition of “ordinal numbers”, we required ordinal numbers to 
be strictly €-well-ordered. Now we see that defining ordinals as being simply €-well- 
ordered would have been sufficient, in the sense that the strictly ordered property 
would follow from “regularity”. Proceeding as we did allows us to see that ordinals 
exist as sets even in the absence of the Axiom of regularity. 


Definition 31.3 We say that a class © is well-founded if S does not contain an infinite 
descending chain of sets. That is, there does not exist an infinite sequence {£n : n E€ w} 
such that ---€ £4 € £3 € £2 € £1 € T0. 





The set-theoretic universe we have explored up to now was not assumed to be a 
well-founded universe. Sets which are not well-founded were simply not considered 
or raised as a subject for discussion. Attempting to prove that non-well-founded sets 
exist, or do not exist, was more or less viewed as a digression from the concepts we 
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were studying at that time. 


We will now show that in the presence of the Axiom of choice,! the two statements 
“Every set is well-founded” and the Axiom of regularity are equivalent. This means 
that in the absence of the Axiom of regularity, we would have to accept that non-well- 
founded sets may exist and study what impact the existence of such sets has in our 


set-theoretic universe.” 


Theorem 31.4 [AC] The Aziom of regularity and the statement “Every set is well-founded” 
are equivalent statements. 


Proof: 


(=) 

Given: The Axiom of regularity holds true. 

Required to show: That all sets are well-founded. 

Suppose there exists a set which contains an infinite descending chain of sets S = {£n : 
n E€ w}. This means that an €-ordered chain such as --- € £4 € £3 E€ £2 E€ £1 E Tọ exists. 
By hypothesis, S must contain some element m such that m N S = @. This element m 
must be equal to x, for some k € w. Since £tk}1 € MN S, we have a contradiction. So 
non-well-founded sets cannot exist in the presence of regularity. 

(<) 

Given: That every set is well-founded. 

Required to show: That every non-empty set S contains an element m such that 
mN S =f. 

Suppose there exists a non-empty set S such that for every m € S, mN S Æ Ø. Then 
there exists a relation R C S x S such that (m, x) € R if and only if x£ € mN S. Since 
mN S 4 Ø for all m € S, the domain of R is all of S. Invoking the Axiom of choice, 
there exists a “choice function” f : S — S, f C R, where, for each m € S, f(m) E MmAS. 
Let xo = f (S). We recursively define a function g : wọ —> S as follows: 





g(0) = zo = f(S) 

g(1) = 2, = f(t) €a0NS 
g(2) = ¢ = f(m)erns 
g(n+1) = tay = flen) ENS 


1The theorem actually uses a weak form of the Axiom of choice, called the Axiom of dependent choice 
?It was shown in 1929 by von Neumann that if “ZF without regularity” is consistent, then “ZF with 
regularity” is also consistent. 
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We have thus constructed a set {£n : n € wo} where, for each n € wo, £n41 € Ln. The 
set {£n : n E wo} is an infinite descending chain. Its existence contradicts the hypothesis 
stating that S is a well-founded set, a set in which no such chain can exist. The source 
of our contradiction is our supposition that S does not contain a minimal element m. 
Hence, in presence of the Axiom of choice, the Axiom of regularity follows from the 
statement “Every set is well-founded”. 


31.2 Transitive closure of a set 


Before we discuss another characterization of the Axiom of regularity, we introduce 
what is known as the transitive closure of a set. Recall that a set S is a transitive set 
if whenever y € Sand x € y, then x € S. 


Definition 31.5 Let x be a set. The transitive closure of x is a set ty satisfying the fol- 
lowing three properties: 


1) The set t, is a transitive set. 
2) Ct 


3) tz is the C-least transitive set satisfying properties 1 and 2. 


Example — Consider the set S = {2}. We see that the set S is not transitive, since 
1 € 2 = {0,1}, 2 € S, but 1 Z S. So S does not contain all the elements required 
for it to be transitive. Starting with S, we will construct, step by step, its transitive 
closure, ts. We have seen that the element 1 is missing, so let’s add it to S: Let 
Sı = {1,2}. We see that Sı is not transitive since since 0 € 1 = {0}, 1 € Sj, but 
0 Z S. We then add to S1, the element 0: Let S2 = {0,1,2}. We see that S2 is the 
natural number 3 known to be transitive. Then, the transitive closure, ts, of S = {2} 
is the natural number 3 = {0, 1, 2}. 


Completing a non-transitive set, S, to its transitive closure, tg, means to add to S all 
the elements which belong to elements of the set. 


The following theorem guarantees that every non-empty set, x, has a transitive clo- 
sure, ty. 


Theorem 31.6 Let x be a set. Then there exists a smallest transitive set, tz, which con- 
tains all elements of x. That is, every set, x, has a transitive closure, tz. 


Proof: 
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Let Z denote the class of all sets. Let f : Z — Z be a function defined as: f(u) = 
Ufy E LZ: y E€ u}. Let rp € F. We recursively define the function g : wọ > F as 
follows: 


g(0) = to E F 

g(1) = 2 = f(x)=Ulye %:y € xo} 
g(2) =a = f(m)=Ulye%: yer} 
g(n) d Tn = len) = U{y EF:YE Ln—1} 
g(m+1) = try = flan)=Ufye Sry € In} 


Let 

tro = U{ Tn : N E wo} = to UTI UT2U +s 
Since xo is a set, each £n is the union of a set of sets and so tro is itself a set. 
Claim: That tz, is a transitive set. 
Suppose u € tz, and v € u. We are required to show that v € tz). Since tz, = U{ Tn : 
n E€ uo}, u € £p for some k E€ w. Since xp41 = f (£k) = Ufy E S : y © re}, UC Ekl: 
Hence, v € u C £k41 C U{tn : Nn E wo} = tro. So tro is a transitive set, as claimed. 
Suppose now that s is some transitive set such that xo C s. 
Claim: tro E 8. 
It suffices to show that x, C s for all n. We will show this by induction. 
Base case: That xo C s is given. 
Inductive hypothesis: Suppose x, C s. We are required to show that p41 C s. If 
u E€ Ing = f(an) = Uy E Z : Y E £n}, then u € y, for some y E€ £n C s. Since s is 
transitive, u € y C s implies u € s. Then £n41 C s. 
Hence, by mathematical induction, £n C s, for all n. Since tz, = U{£n : n € w}, try Cs, 
as claimed. 


We have thus constructed the smallest transitive set t;, which contains x9. Then for 
any set x there exists a smallest transitive set ty which contains x. 


31.3 Constructing the class of all sets in stages 
In this section we will construct, by stages, an ordinal-indexed class 
{Va:a€ O} 


of sets, starting with the empty set Vo = Ø. This structured class, ordered by inclu- 
sion “C”, is referred to as the Cumulative hierarchy of sets.! The union, V = U{V, : 


1The terms “rank hierarchy of sets” is sometimes used. 
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a € Ê}, of all the V,-sets is referred to as the Von Neumann universe of sets or the 
Von Neumann hierarchy of sets. The restrictions imposed by the Axiom of regularity 
on the type of sets which belong to the ZFC-universe will guarantee that every set 
which belongs to the ZF'C-universe of sets also belongs to some Va C V. That is, V 
accounts a for all sets whose existence is determined from the ZF'C-axioms. 


We have often used in this text the symbol .Y to denote the “class of all sets” in 
the ZFC-universe. The class V of sets which we will now investigate will also denote 
the class of all sets; but the class V is structured, as we will soon see. It is con- 
structed in stages, starting only with the empty set @ . The construction of the class, 
{Va : a E€ Ê}, is described below. 


Definition 31.7 Define the class function, f : Z => Z, as f(S) = A(S). The elements 
of the class, {Va : a E€ Ê}, belong to the image of the class function g(a) = Va recursively 
defined as follows: 


g0) = VW = @ =i 
g1) =U = fM%) = 2?(0)={Øø}=1 
G2) 2 Vs 3S GV SS PM St ee. aso 
g8) = V3 = fV) = P(2)={ {{o}, of, {{o}}, {2}, } 
g(A) = V4 = f (V3) = P (Vz) (24 = 16 elements) 
glat) = Va = fV) = PV 
If \ = limit ordinal, g(X) = Vy _ Uac Va 


The class {Va : a € Ø} is called the Cumulative hierarchy of sets. The union of all the 
elements of the cumulative hierarchy of sets is denoted as: 


V = Uaco Va 


We verify that every single element, Va of the cumulative hierarchy of sets is indeed 
a set: Since each Va is either the image of a set, the power set of a set, or the union 
of a set of sets, then each Va is a set (by the axioms of replacement, union and power 
set). That is, 

{Va:a€ O} 


is a class of sets. 


346 Section 31: Regularity and the cumulative hierarchy 


Every Va is a transitive set : Let P(a) denote the statement “Va is transitive”. 


Since Vo = Ø, P(0) trivially holds true. 
Suppose P(qa) holds true. That is, suppose Va is transitive. 


y E x E€ Va+ = P (Va) yer Va 


y C Va (Since Va is transitive.) 
y € P (Va) = Vat 

V,+ is transitive 

= P(a*) holds true. 


UUL 


Suppose y is a limit ordinal and that a € y implies P(a) holds true. 


y E€ x E€ Vy = Uac Va => yEex € Vg for some transitive Vg 
> yEeVeCvV, 
= V, is transitive 
= P(y) holds true. 


By transfinite induction, every Va is transitive. 


Up to now, we have always represented the class of all sets which has evolved from 
the ten ZFC-axioms by the symbol, Z = {x : x isaset}. Since every element of 
{Va :a E€ Ê} is a set, we can write, {Va : a E€ O} C SY. Furthermore, 


LEV = Uac Va => LEV,, forsomey¢ 6 
=> x C Vy, Since Vy is transitive. 
=> «x isaset. 
> LTES 


Hence 
V = Uaco Va Œ S 


We now wonder whether Z C U{Va : a E€ @}. That is, are all sets accounted for in 
V. We will prove that this is indeed the case. With this objective in mind we first 
establish the following lemma. 


Lemma 31.8 The class {Va : a € Ê} is a strictly €-increasing (C-increasing) chain of 
transitive sets. That is, if a € 8, then Va € Vg. Hence, Va C Vg. 


Proof: 
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We have proven above that Va is transitive for every a € Ô. 

Let P(y) denote the statement “a € y implies Vy € Vy”. 

Suppose P() holds true all G € y. That is, if a € 8 € y, then Va € Vz. 

Suppose ¢ € y. We are required to show that Vg € Vy. 

— Case 1: If y is a limit ordinal, then Vg € Vg+ C UgeyVa = Vy. 

— Case 2: Suppose y = yt. If d € p, then Vg € Vy, by the inductive hypothesis. If 
p =, then Vg = Vy € A(Vy) = Vyt+ = Vy; hence, P(y) holds true. 

By transfinite induction, a € y implies Va € Vy for all y. Since Vy is transitive, Va C Vy 
for all y 


Lemma 31.9 For any non-empty set B, B C V implies BE V. 


Proof: 
Given: B is a non-empty set such that B C V = UVa :a E€ Ø}. 
Required to show: That B € V. 


Let u € B. Then B C V => u € Va for some qQ € Ô. 
Then the set {a € Ê : u E€ Va} is non-empty. This ensures that the function f : B > @ 
defined as 

f(u) = least{a € Ê : u € Va} 


is well-defined. 
— Since B is a set, by the Axiom of replacement, f|B] is a set of ordinals. Since f[B] is 
a set, 3 = Ugeipjo (a union of a set of sets) is a set. By theorems 27.11 and 27.12, 
@ is an ordinal. 


— Since a C £ for all a € f[B] and £ is transitive, a €- 8, for all a € f[B}.! 

— By lemma 31.8, for every a € f[B], [a €= 8] > [Va € Val. 
Then, for every u € B, u € Vu) C Vg. This implies that B C Vg and so B €= A(Vg) = 
V+ C V, as claimed. 


In the following theorem we refer to the sets in Z, the class of all sets. 


Theorem 31.10 [Axiom of regularity] For every set x, x € V = UgeoVa. That is, Z C 
V = Uaco Va. 


Proof: 


Recall that “E_” is read as “is an element of, or is equal to”. 
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Given: That x is a set and that V = Uae Va- 

Required to show: That x € V. 

If x = Ø, then x € V, we are done. We then suppose x # Ø. 

Claim : The transitive closure, ty, of the set x, is a proper subset of V. 

Suppose t, É V. That is, suppose there is an element u such that u € ty and u ¢ V. 
Then the set U = {u € ty : u ¢ V} is non-empty. Since U is non-empty, by the Axiom 
of regularity, U contains a minimal element m € U. That is, m € U and mNU = Ø. 
Note that m ¢ V. If m = Ø then m E€ V, a contradiction; so m is a non-empty set. 
Suppose y E€ m. Since tz is transitive and m €E tz, then y € tz. But since mM U = Ø, 
y cannot belong to U. Then y € V. We conclude that m C tz V. By the previous 
lemma, [mn C V] = [m € V]. This contradicts the fact that m € U. The source of 
this contradiction is our supposition that U = {u € tz : u Z V} is non-empty. Then 
U = {u € tx : u V} is empty and so t C V, as claimed. 

By the previous lemma, ty € V. Then tz € Va for some a. 


x C ty E€ Va => T C ty G Va (Since Va is a transitive set) 
=> tCVa 
> zE P(Va =V CV 


So x € V, as required. 


Since Y C V and every element of V is a set, then the class V = Uacg Va provides 
us with a description of the set-theoretic universe, %, which evolves from the ten 
ZFC-axioms (nine ZF-axioms plus Choice) listed in chapter one. It is interesting to 
note the class, V = Uge@Va, contains all sets only in the presence of the Axiom of 
regularity. That is, without the restrictions that the Axiom of regularity imposes on 
sets, some sets in Z would not be accounted for in V. We now show that if Y C V, 
then the Axiom of regularity holds true on Z. 


Theorem 31.11 Let V = UgeeVa be the class of sets constructed as described above and 
Z denote the class of all sets. If AY C V, then every set in Y has a €-minimal element. 


Proof: 


What we are given: That the class V contains all sets. 

What we are required to show: That every non-empty set x contains a €-minimal element. 
Suppose x is a non-empty set. By hypothesis, x belongs to V = UgeeVa. Then z € Va 
for some a. Since Va is transitive x C Va C V. We can then define a function f :2—- @ 
as f(u) = least{a € @:u C Va}. Since x is a non-empty set, by the Axiom of replace- 
ment, f|x] is a non-empty set of ordinals and so contains a least element, say ¢. Since @ 
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is in the image of x under f, there exists an element m in the domain x of f such that 
f(m) = ¢. Then ¢ is the least ordinal such that m C Vg; equivalently, it is the least 
ordinal such that m € Vy+. Then m ¢ Vg. 

We claim that m is an €-minimal element of x: Suppose not. Suppose z € mM a. Then 
since z E m C Vg implies z € V4 we have f(z) € f(m) = ¢. This contradicts the fact 
that ¢ is minimal in f|z]. Then m is an €-minimal of x as claimed. 


Then every set has a minimal element with respect to “e”. 


The above theorem shows that, 


For every ordinal a, Va, satisfies the axiom of regularity. 


31.4 The rank of a set. 


The following concept will allow the reader to develop some familiarity with the ele- 
ments of {Va : a € Ê} and how they are defined. 


Given any set U, we will define the rank of the set U, denoted as rank(U), as follows: 


rank(U) = least{a € © : U C Va} 


First note that rank : V — mapping a set, U, to a unique ordinal number, rank(U), 
is a well-defined function. 


Example — We list the sets Vo to V4 to help us determine the rank of a few simple sets. 


g0) = YW = @ = 0 

gl): = Vp = fo) = ?(0)={2}=1 

g2) = VY = fV) = PZ(1)={ {9},o}=2'=2 

g(3) = Vs = fV) = VY2)={ {{2}, 2}, {312} 2 } 
g(4) = Vy = f (Və) = P (V3) (2f = 16 elements) 


— Suppose A = 3. Since 3 = {0,1,2} = { Ø, {Ø}, {18}, Ø} } C V3 and 3 ¢ Va, 
then rank(A) = 3. 

— Suppose B = {{@}}. We see that {{@}} C Vo and {{o}} ¢ Vj; hence, 
rank(B) = 2. 

— Suppose C = { {{{@}}},@}. We see that both {{{@}}} and Ø belong to 
P (Vz) = Va; hence, C C V4. Since {{{2}}} ¢ V3, C É Vz; hence, rank(C) = 4. 


If we know the rank of the elements of a set U it may help determine the rank of the 
set itself. We prove a few useful facts about the rank of various sets. 
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Theorem 31.12 Let V =U{V,: a€ @}. 
a) The rank of the empty set Ø is zero. 
b) If U € Vg, then rank(U) < 8; hence, U ¢ V;anky) for all sets U. 
c) If U is a set then [rank(U) < 6] = [U € Vg]. 
d) If U and V are sets such that U € V, then rank(U) < rank(V). 


e) If y is an ordinal, then rank(y) = y. 


Proof: 


a) Note that if U = Ø, then rank(U) is the least ordinal number a such that Ø C a, 
namely 0. So rank(@) = 0. 

b) Given: U € Vg. 
Required to show: rank(U) < 6 and so, U ¢ Viank(u) for all sets U. 
To show this we consider two cases: 
Case 1: Suppose 6 = a + 1 for some a. Then Vg = Va+ı = Z(Va). Then 
[U € Vg] = [U C Va]. It then follows that the rank(U) < a < a + 1 = 2, as claimed. 
Case 2: Suppose / is a limit ordinal. Since U € Vg = UacgVa, then U € Va for some 
a € B. Since Va is transitive U C V, so again, rank(U) < a < p, as required. 
If U € Viank(U), then, by case 1 and 2, rank(U) < rank(U), a contradiction. Hence, 
for all sets U, U € Vrankcu), as required. 

c) Given: rank(U) < $. 
Required to show: U € Vg. 
See that [rank(U) < 6] = [U C Va], for some a where a+1 < 8. Then U € Va+ı C Vo. 
Then U € Vg, as required. 
So for any ordinal a, Va is precisely the set of all sets U whose rank is less than a. 

d) Given: U and V are sets such that U € V. 
Required to show: rank(U) < rank(V). 


UeVe Vrank(V) > Ue Vrank(V) 
= rank(U) < rank(V) By part b).) 


e) Given: y is an ordinal. 
Required to show: That rank(y) = y. 
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31.5 


Claim: rank(y) < y for all ordinals y. 
Suppose y < rank(y). 


Fy < rank(y) y < b < rank(y) For some ordinal £. 
y€ Bex rank(y) 


Y E€ B G Ve € Vranko) By part b) of lemma 31.8. 


{id yy 


ay € Vanko) Contradicting part b). 


The source of the contradiction is our supposition that y < rank(y). Then rank(y) < y, 
as claimed. 


We now show that rank(y) = y for all ordinals y. We prove this by transfinite math- 
ematical induction. Suppose rank(a) = a for all ordinals a € y. It suffices to show 
that rank(y) = y. Since we have shown that rank(y) < y, it suffices to show that 
rank(y) £ y. Suppose not. That is, suppose rank(y) = 8 < y. Then, by the induc- 
tive hypothesis, rank(3) = 6 = rank(y). Since, 86 < y, @ € y and so, by part d), 
rank(8) < rank(y), contradicting rank(8) = rank(y). Hence, rank(y) < y is impossi- 
ble. Then rank(y) £ y and rank(y) < y implies rank(y) = y. By transfinite induction, 
rank(y) = y for all ordinals y. 


Remark: If a is any ordinal then, by part e) of theorem 31.12, rank(a) = a, there- 
fore œ C Va and a É Vg, for any ordinal 6 € a. This means that the set of all 
natural numbers, wo, is a subset of Voo. Furthermore, we mentioned earlier that 
R C P" (wo) = Vopr (see page 161); then we can safely say that R C Vogtwo- Hence, 
even for relatively small ordinals, a, Vą can contain many of the sets we really care 
about in mathematics. In fact, as we shall soon see, many of the ZFC-axioms hold 
true in some Va where a is relatively small. 


The ZFC-set-axioms ingrained in the V,-sets. 


In theorem 31.11, we have shown that if Z denotes the class of all sets in the ZFC- 
universe, then .Y = V = Uaeg Va. Since all of the ZFC-axioms hold true on Z, then 
they must hold true in reference to the sets in V = UgegVq. In particular, we have 
seen that if we exclude the Axiom of regularity from the list of the ZFC-axioms, then 
Uac Va is a proper subclass of Z. 


We now investigate which of the V,-sets satisfy each of the ZFC-axioms. This will 
provide us an opportunity to analyze and better understand what each of the ZFC- 
axioms mean. If Axiom A holds true in Va then we say that “Va is a model of Axiom 
A”. 

Axiom of pair. Recall that the Axiom of pair states that if a and b are sets, then 
{a,b} is also a set. To show that a set K satisfies this axiom we must prove that 
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abe k= {a,b} E K. 


Proposition 31.13 Let Ø be a limit ordinal and a and b be two elements of Vg. Then 
{a, b} is an element of Vg. That is, Vg satisfies the property described by the Axiom of pair. 


Proof: 


We are given that a and b are two elements of Vg where ( is a limit ordinal. 

We are required to show that {a,b} € Vg. 

Since a,b € Vg = UacgVa, then there exists some y € 8 such that {a,b} C Vy C Vz. 
Then {a,b} E A(V,) = Vy41 C Vg. Then {a,b} € Ve. 


So if @ is a limit ordinal, Vg is a model of the Axiom of pair. 


Axiom of union. The Axiom of union states that if A = {U : U € A} isa set of sets, 
then the union D = U{U : U € A} isa set. 


Proposition 31.14 Let 8 be any ordinal. If U is an element of Vg, then U{x : x € U} € Vg. 
That is, Vg satisfies the property described by the Axiom of union. 


Proof: 


Given: That U € Vg. 
Required to show: That U{a: x € U} € Vz. 


Since U € Vg, then, by property b) in theorem 31.12, rank(U) < £. 
Let y E€ U{x : x € U}. Then y is an element of some x € U. Since y € x € U, then 
rank(y) < rank(x) < rank(U) (by part c) of theorem 31.12 above). By part b) above, 


[rank(y) < rank(U)] and [rank(U) < 6] | = [y © Viankiuy C Vol 


So y € Vg. We deduce that U{x : £ € U} C Vank) C Ve. This means that 
U{z: x € U} € P(Vianku)) E Va. It follows that U{x : x € U} € Vg, as required. 


So every Va satisfies the property described in the Axiom of union. 


So, for any ordinal 8, Vg is a model of the Axiom of union. 


Axiom of power set. We now prove that for any limit ordinal 3, Vg satisfies the prop- 
erty described by the Axiom of power set. To show that a set K satisfies this property 
we must show that if U € K, then AU) € K. 
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Proposition 31.15 Let 3 be a limit ordinal. If U is an element of Vg, then Vg also contains 
aset Y = A(U) such that S C U implies S € Y. That is, Vg satisfies the property described 
by the Axiom of power set. 


Proof: 


Given: That U € Vg. Then by part b) of theorem 31.12, rank(U) < £. 
Required to show: That A(U) € Vg. 

Since U € Vg, then, by property b) in theorem 31.12, rank(U) < £. 
For any ACU, ACU C Vank). Then, for all subsets A of U, 


AE P (Vrank(u)) = Vrank(U)-+1 


Then AU) C Viankcy)41, and so P(U) E€ P(Viank(U)+1) = Vrank(y)+2- Since 8 is a 
limit ordinal, and rank(U) < 8, then rank(U) +1 < rank(U) +2 < 8. Then we obtain 


PU) € P (Vrank(U)-+1) = Vrank(U)-+2 Cc Ve 


We conclude that A(U) € Vg, as required. 
So, for every limit ordinal Ø, Vg satisfies the property described by the Axiom of power 
set. 


We have just shown that if @ is limit ordinal, Vg is a model of the Axiom of power 
set. 


Axiom of extent. We begin with a brief discussion of the Axiom of extent. We remind 
ourselves about what the Axiom of extent states: Given a pair of sets A and B, if for 
all elements x, (x € A & x € B) holds true, then A = B. If we want to verify whether 
a set K satisfies the Axiom of extent we must show that: Given any pair of elements 
x and y in K, if for all z € K(z € x & z E€ y), then z and y are the same set. 


We first provide an example of a set in which the Axiom of extent does not hold true. 
Suppose 
K= {{b, C, d}, {a, b, d}, {b, c}, b, d} 


For the pair {b, c, d}, {a, b, d} in K, and elements, b, d € K, we see that the conditions 
be {a,b,d} = be {b,c,d} 
dé {a,b,d} = de {b,c,d} 


hold true in K, but in spite of this {a,b,d} 4 {b,c,d}. Although K “recognizes” 
b and d as its elements it does not “recognize” the elements a and c (meaning that 
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a,c ¢ K). So the Axiom of extent does not hold true in the set K. 


We now consider the set 
H={2, {2}, {{o}, g} HØM {22} } 


Now H satisfies the Axiom of extent since, 


ØE {0,1} = we {{o}, oa} 
{2} E{S,{S}} © {S} € {{P}, 2} 


and the set H “recognizes” all the elements of both sets {@,{@}} and {{@}, Ø}, as 
elements of itself and so is in a position to confirm that {@,{@}} and {{@}, Ø} are 
indeed the same set. This is something that the set K in the previous example could 
not do. The reason that the Axiom of extent is satisfied in H is that H is a transitive 
set. The transitive property of the set K allows it to recognize all the elements of its 
sets and confirm which sets are equal and which sets are not. With these thoughts in 
mind we confirm that every set Va satisfies the Axiom of extent. 


Proposition 31.16 Let aœ be any ordinal. For any two elements x and y of Va, if for all 
z E€ Valz E £x & z E y), then z and y are the same set. 
Proof: 


Given: That for any two elements x and y of Va, Vz E€ Valz E £ & z €y). 
Required to show: That x and y are the same set. 


Suppose x and y are not the same set. Then, without loss of generality there is some 
element u € y— (that is, u € y but u ¢ x). Since Va is a transitive set, then 
[u € y € Va] => [u € Va]. So we have, “u € y and u ¢ x”, for some u € Va. But 
this contradicts our hypothesis “for all z € Va(z € « & z € y)”; that is, for u € Va, 
(uE y)=> (wen). 

So, for every ordinal a, Va satisfies the property described by the Axiom of extent. 


We now see that for any ordinal 3, Vg is a model of the Axiom of extent. 


Axiom of infinity. The Axiom of infinity states that there exists a non-empty set A 
(called an inductive set) that satisfies the condition: “x € A” > “xU {xr} € A”. The 
set, Wo, was defined to be the smallest inductive set. To show that a set K satisfies 
the Axiom of infinity we need to show that wo € K. 


Proposition 31.17 Let œ be an ordinal number such that œa > wo. Then wo € Va. 
Proof: 
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Given: That a be an ordinal number such that a > wọ. 

Required to show: That wo € Vo. 

First note that Vpn is finite for all n € wọ. This is easily verified by a proof by mathemati- 
cal induction on wo. Since Voo = U{V, : n € wo} and wo is infinite, wo E€ Vo is impossible. 
Since wo is an ordinal, rank(wo) = wo (by theorem 31.12). Hence, the least ordinal a 
such that wo C Va is wo. Then wo C Vus. This implies wo € P (Voo) = Vay+i © Vy, for 
all y > wo. 

We conclude that wo € Va for all œ such that œ > wo, as required. 


We have shown that for any ordinal 3 > wo, Vg is a model of the Axiom of infinity. 


Axiom of subsets. The Axiom of subsets states that if S is a given set and ¢ is a for- 
mula expressed in the language of set theory, then the class of all elements x of S for 
which ¢(«) holds true is a set. To show that the Axiom of subsets holds true in a set K, 
we must show that if M is a class whose elements belong to a set S € K, then M € K. 


Proposition 31.18 Let œ be an ordinal number. Then the Axiom of subsets holds true in 
Va: 


Proof: 
Given: That a is any ordinal, that S € Va and M = {x E€ S : ¢(a)} CS. 
Required to show: That M € Va. 


Since S € Va, then rank(S) < a (by theorem 31.12). Since M = {x € S : d(x)} CS, 
then rank(M) < rank(S) < a (again by theorem 31.12). Given that rank(M) < a, then 
Vianx(m) C Va- It follows that 


MC Viank(M) E P (V ankim) ) = Vank(m) +1 C Va 


Since M is a subset of Viankcar), then M € A(Viankcary)) C Va. We conclude that M € Va, 
as required. 


We now see that for any ordinal 3, Vg is a model of the Axiom of subsets. 


Axiom of replacement. The Axiom of replacement can be expressed as follows: Let 
w be a set. If ¢ is a formula so that for every x € w, [¢(x, z) and ¢(z, y)] > ly = 2], 
then the class {u : (x, u) for some u € w} is a set. 


To show that a set K satisfies this axiom we must show that if w € K and disa 
formula so that for every x € w, [b(x, z) and (x, y)| > ly = z], then {u : d(x, u) for 
some u E€ w} € K. 
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We first provide an example of a transitive set in which the Axiom of replacement does 
not hold true. Consider the set Vobis We first note that wo © Veank(wp) = Vuo C Vuowo; 
hence, wo C Voowo: Define the function f : wo > wowo as follows: f(n) = won. Then 
flwo] = {won : n E€ wo} = wowo. 

We claim that f[wo]  Voowo: Since rank(wow9) = wowo, then wowo is the least ordinal 
a such that f[wo] = wowo C Va. Then wowo + 1 is the the least ordinal such that 
flwo] = wowo E Alwowo) = Vawot1- Then flwo] = wowo E Vugwo- So the images 
of sets in V4) need not necessarily be sets in V.,,4), aS claimed. (Note that we can 
prove that Voo+wo does not satisfy the Axiom of replacement by mimicking this proof.) 


We will, however, show that the Axiom of replacement holds true in Vog. 


Proposition 31.19 The set Vo, satisfies the property described by the Axiom of replace- 
ment. 


Proof: 


Given: That w € Vo and f : w — V is a function mapping w into V. 

Required to show: That f[w] € Vi. 

Since w € Voo = U{Vn : n © wo}, then w E€ Vy for some k € wo and so w is a fi- 
nite set. Since f is a function and w is finite, then f[w] is a finite set. Say, f[w] = 
{bo, b1, b2, ..., bm}. Let 6 = lub{rank(b;) + 1 : i = 0 to m}. Then £ is the least ordinal 
such that b; € Vranko, +1 C Vg, for i = 1 tom. Then f[w] = {bo, b1, b2,..., bm} C Ve. 
Then f[w] = {bo, 61, b2,..-,0m} E A(Ve) = Vesi E Vay, as required. 


We have just shown that Vos is a model of the Axiom of replacement. 


Axiom of choice. The Axiom of choice states that if A is a set of sets, then there 
exists a function f : A —> U{a: x € A} which maps each subset x of A to an element 
of yz € x. Since F = V = U{V, : n € wo}, then we know that the Axiom of choice 
holds true on V. We now show that provided the ordinal y is a limit ordinal, then 
the Axiom of choice also holds true on V}. 


Proposition 31.20 Let y be a limit ordinal. Then the set V} satisfies the property de- 
scribed by the Axiom of choice. 


Proof: 


Recall that: 
wo2 = {1,2,3,...,wo,wo +1,...,wo+n,...,}, 
wo3 = {1,2,3,...,wo,..-,wo2,wo2+1,...,wo2+n,...,}, and 


wowo = {1,2,3,...,wo,...,wo2,...,wod...,won,..., } 
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Given: That y is a limit ordinal and that the Axiom of choice holds true on V. 
Required to show: That the Axiom of choice holds true on V}. 


Let U € Vy = U{Va : a E€ y} where U # Ø and Ø ¢ U. We are required to show that 
there exists a function f € V, mapping U onto a set f[U] € V} such that f(x) € x for 
each x € U. 


Since y is a limit ordinal and U € Vy = U{Va : a € y}, then U € Vg for some ordinal 
Gey. Since Vg is transitive, 
Ue Ve => UC Ve 
=> x€ Vg for each x € U 
=> xC Vg for each x € U 
=> U{z:x eU} C Vg 
Now U and U{x: x € U} C Vg are also elements of V. Since the Axiom of choice holds 


true on V, then there exists a function f € V, f : U > U{a : x € U} mapping each 
element x of U to an element yz € x € Vg. We are required to show that f € V}. 


See that f[U] = {yz : £ E U} C Vg; hence, f[U] E A(Vg) = Vari € Vy. 

Claim: f € Vy. 

We have seen that if x € U, then ys = f(x) € Vg € Vy. Recall that the ordered pair 
(x, yx) is defined as {{x}, {x, yx}}. Given (x, yx) € f, 


£T, Yr E Ve > {5 Ye} € Ve 
=> {£, yc} E P(V~) 
= {x, yz} z Ve4i 


Similarly, x € Vg implies {x} € Vg41. For the same reasons, 


{z}, {x, Yz} € V+ > (£, Yx) = {Hr}, {x, Yst) € Vo+2 


Then f = {(x, yz) : £ E U} C Vg42. This implies f € Vgi3 € Vy, and so f € V} as 
claimed. 


We have shown that there exists a function f € Vy mapping U onto a set f[U] € V} such 
that f(x) € x for each x € U, as required. 


We have shown that if 7 is a limit ordinal, Vg is a model of the Axiom of choice. 


Axiom of construction. We now show that for any ordinal a, the Axiom of construc- 
tion also holds true on Vg. 


Proposition 31.21 Let œ be an ordinal. Then the set Va satisfies the property described 
by the Axiom of construction. 
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Proof: 


Given: That qa is an ordinal. 

Required to show: That the Axiom of construction holds true on Va. 

Let T € Va and ¢ be a formula in the language of set theory. Then rank(T) < a (by 
theorem 31.12 b)). Let B= {x € T : ¢(x)}. Since B C T, rank(B) < rank(T) < a. By 
theorem 31.12 c), B € Va, as required. 


31.6 


We now gather together the above results on those Vą’s which are models for the 
ZFC-axioms. If we step back a bit, we obtain the following picture: 


a) Vo is a model for all ZFC axioms, except for the Axiom of infinity. 


b) For the limit ordinal 8 = wow , or B = wo +wo, Vg is a model for all ZFC axioms, 
except for the Axiom of replacement. 


It can then be shown that the Axiom of infinity cannot be logically derived from the 
other nine axioms. That is, it is independent of the other axioms. This is of course 
what we want. It is just that, quite often, given a set of axioms, we don’t always know 
for sure whether a particular axiom is independent of others in this set. 

Similarly, for the limit ordinal, G = wow , we have shown that nine of the ten ZFC- 
axioms hold true in Vg. It can be shown that the Axiom of replacement is independent 
of the other nine axioms. That is, it is impossible to prove the Axiom of replacement 
from the other axioms. 


The GCH and the cumulative hierarchy. 


Note that neither the Continuum hypothesis (CH), nor the Generalized continuum 
hypothesis (GCH), play a role in the construction of V. Similarly, the proof showing 
that Z = V does not invoke CH nor GCH. That is, Z = V neither assumes nor 
negates CH and GCH. 


We pause to reflect a bit on this matter. Recall that the Generalized continuum 
hypothesis declares that for any infinite set S, there does not exist a set T such that 
S CT C P(S) where the cardinality of T is strictly larger than the cardinality of S 
and strictly less than the cardinality of A(S) (recalling that “cardinal numbers” are 
initial ordinals). Equivalently, 


GCH © [Nai = 2%, Vac O] 


Suppose we don’t assume GCH. Then it may be the case, for example, that No < Ni < 
No < Ng = 280, where N4 is the least ordinal such that Ny £e No, No is the least ordinal 
such that N2 Że N1, and Nz is the least ordinal such that Ng £e Xe. Will these “extra” 
sets, N1, No, for example, be present in some Va? We have wo < wy < we < w3 = 29, 


A. 
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where wo, w1, w2 and wg are the corresponding initial ordinals. We have shown above 
that rank(w,) = w;. That is, the least a such that wı C Va is wı. In this case, 
wi C Vy, while w1 € Voo- This means w1 E€ A(V,,) = Vu,41 and so w; appears in 
Vay41 C V = UneoVe. Similarly, w2 E Voz+1 C V. So whether we assume GCH or 
not, all sets in .Y are accounted for in V. So the equality, .Y = V, is not sensitive to 
assumptions made on CH or GCH. 


Concepts review: 


1. 
2. 


10. 


11. 


12. 


State the Axiom of regularity. 


What does it mean to say that a set S has a minimal element with respect to €? 


. Which statement which refers to a minimal element of sets is equivalent to the Axiom 


of regularity? 


. What does it mean to say that a set is well-founded? 


. If we assume the Axiom of choice, which ZFC axiom is equivalent to the statement 


“Every set is well-founded”? 


. Given a non-empty set x, what is the transitive closure of x? 
. Given a non-empty set x, does x necessarily have a transitive closure? 
. How is the class of sets V = Uaeg Va constructed? 


. The class {Va : a € @} has a strict linear ordering with respect to which order 


relation? 


The statement “The class {Va : a E€ @} contains all sets” is equivalent to which ZFC 
axiom? 


What is the rank of a set U? 


If a is an ordinal, what is its rank? 


EXERCISES 


1. Describe the transitive closure t, for each of the following sets x. 
a) x = {1,2} 
b) z = {0, {2,3}} 
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c) x = {0,1,2} 
d) z = {{Ø}} 


. Show that a set S' is transitive if and only if for every z € S, £N S = x. 
. Show that x is transitive if and only if x is its own transitive closure tg. 
. Show that the V,’s in the expression V = Uac Va are sets. 


. Show that if y € O, Useya = sup {a : a € 7}. 

. Write out all the elements of each of the sets Vo to V5. 
. Describe V, and V,,+. 

. What is the least ordinal a such that 7 C Va? 


. If x is a set, then we define the rank of x, rank(x), as follows: 


rank(x) = least{a € Ê : x C Va} 


What is the rank of the set N? 

If the sets A and B have the same rank what is the rank of A U B? 

Can you find a set S such that the rank of Uaes A equals the rank of S? 

Can you find a set S such that the rank of Uaes A is an element of the rank of 
S? 

Show that the Axiom of infinity cannot be proven from the other ZFC-axioms. 


a 
b 
c 


) 
) 
) 
d) 


Part IX: More on axioms: Choice, regularity and Martin’s axiom 361 


32 / Martin’s axiom 


32.1 


32.2 


Summary. In this section we define the “countable chain condition” on a par- 
tially ordered set (P,<). We then define those subsets of (P,<) called “filters”. 
We introduce an axiom which is independent of ZFC called Martin’s axiom, of 
particular interest when =CH is assumed. We then list a few consequences of 
this axiom. 


Introduction 


At this point we have discussed 9 basic set theory axioms we called the ZF-azioms. 
To these 9 axioms we have adjoined a tenth axiom called the Axiom of choice. When 
viewed together these axioms are referred to as the ZF'C-axioms or ZF+Choice. Most 
mathematicians view these ten axioms as constituting a firm and reliable foundation 
of mathematics. A few mathematicians or logicians, as well as certain philosophers 
of mathematics, continue to investigate these axioms identifying and analyzing what 
they consider to be some of their shortcomings or weak points, occasionally question- 
ing the validity of some of these. And this is fine, since no one can prove that the 
ZFC-axioms will not, at some point in time, lead to some contradiction. Many mathe- 
maticians view some other axioms, independent of these, as being useful tools to prove 
certain mathematical statements they consider to be important. Examples of these 
are the Continuum hypotheses axioms: CH, GCH, ~CH and =GCH. The Continuum 
hypothesis, CH, declares that the smallest cardinal number which is larger than the 
countable cardinal No is 8; = 2%°. The negation, CH, of CH declares that there 
is at least one uncountable cardinal 8; such that No < Nı < 2%, The Generalized 
Continuum hypothesis, GCH, states that, for any cardinal number Na, the smallest 
cardinal number which is greater than Na is No41 = 2%«. It’s negation declares that 
there are cardinal numbers Na such that Na < Nap < 2%. 


In this section we will discuss another axiom called Martin’s axiom!. This is a slightly 
more advanced topic of set theory since the understanding of some proofs assume some 
basic knowledge of topology on the part of the reader. Before we state and describe 
this axiom we must introduce two notions associated to partially ordered sets, (P, <): 
the countable chain condition in P and special subsets of P called filters. 


The countable chain condition ccc on a partially ordered set. 


Recall that a partially ordered set is a set P on which we have defined a relation < 
which is reflexive, transitive and anti-symmetric. As an example consider the closed in- 
terval X = [0, 1]. Let r(X) be a subset of the power set, A(X), of X defined as follows: 


‘Introduced in 1970 by Donald A. Martin and Robert M. Solovay 
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T(X) = {U € A(X): U # Ø and U is a union of open intervals [0, a), (a,b), (a, 1]}. 
The elements of 7(X) are referred to as the non-empty open subsets of X. If we equip 
T(X) with the relation C then (7(X), C) is an example of a partially ordered set. In 
what follows, \P denotes the minimal element of P, (if P has one). 


Suppose (P, <) is a partially ordered set which may or may not contain AP (the min- 
imal element of P). Let A be a subset of P such that 1) AP ¢ A (if AP exists), 2) 
for any pair u, v € A, u and v are not comparable under < and 3) there does not 
exist an element r € P such that r < u and r < v. When A satisfies simultaneously 
these three properties we say that “A is strong antichain in the partially ordered set 
P”. We normally drop the adjective “strong” and simply say “antichain” when there 
is no risk of confustion. With this in mind, we introduce the following concept. 


Definition 32.1 Let (P, <) be a partially ordered set (which may or may not contain AP). 
If P contains no uncountable strong antichain then (P,<) is said to satisfy the countable 
chain condition’. In this case, we say that (P,<) satisfies the ccc or that (P,<) is a ccc 
partial order. 


32.3 


For example, let X = [0, 1] and (7(X), C) be the partially ordered set defined above. 
We verify that (7(X), C) is a ccc partially ordered set. Suppose A is a strong antichain 
in 7(X). The ccc property means that the open sets in A are pairwise disjoint. We 
index the elements of A as follows: A = {U, : k < Na} for some cardinal Na. From 
each U,, E€ A we choose precisely one rational number q, (we can do this since U,, is 
a union of open intervals). Since |Q| = No then |{q, : K < Na}| < No. It follows that 


|A| < Xo and so A is countable. We conclude that (T(X), C) is a ccc partial ordered set. 


Dense subsets of a partially ordered set (P, <). 


Given a partially ordered set (P, <) there are special subsets of P said to be dense in 
P. We explain what this means in the following definition. 


Definition 32.2 Let (P,<) be a partially ordered set. Let D be a subset of P such that for 
every element p in P there exists an element din D such that d < p. A subset D satisfying 
this property is said to be dense in the partial ordering (P, <). 


1 Although “countable antichain condition” would have probably been a better choice of words to describe 
this property. 
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32.4 


Examples of dense subsets of a partially ordered set. 


Example 1. Let X = [0,1] and (7(X), C) be the partially ordered set defined above. 
We will construct a dense subset 2 of (T(X), C) as follows. Let 


& = {(x—e,x +e) C [0,1]: 2 € (0,1),e€ (0,1)} 


We claim that & is dense in (7(X), C). Let U be an element of r(X). Then U is a non- 
empty open set (read “a union of open intervals”). Since U is non-empty there is an 
x € U such that z is not 0 or 1. Then there exists € such that (v—e, +e) C (a,b) CU. 
(This is a fundamental property of the real numbers.). Then, for every element U of 
T(X), there is an element of & which is a subset of U. So & is dense in (T(X), C) as 
claimed. 


Example 2. Let (P, <) be a partially ordered set. If £ € P we define x! as follows: 
a= (ye Py <a} 


Let u and v be fixed elements of P. We say that u and v are compatible in P if there 
exists t € P (where t is not the minimum element of P) such that t € ul Nut. Let 
Vaw) = {x € P : x is not compatible with u or v}, Wa) = ut N vt and Du) = 
Veuso) U Wav): 


We claim that Duw) is dense in P: Let z € P and y < z. If yl Nut = Ø or 
yl Nvl = Ø then y € Viuv) 80 y E Diu w) and we are done. Suppose, on the other 
hand, that q € yt Nut. Since y < z, q E€ zl Nu! where q < y < z. If ql nv! = Ø then 
q © Vu w) and we are done. If k € q! Nv! then k € ul Nvl. We then have k < z and 
k € Waw) S Diuw) S0 Duw) is dense in P, as claimed. 


A filter in a partially ordered set (P, <). 


A filter is a non-empty subset of a partially ordered set (P, <) which satisfies two 
properties. Those subsets of partially ordered sets which we call filters are defined as 
follows. 


Definition 32.3 Let F be a subset of a partially ordered set (P, <). If F is non-empty 
and satisfies the two properties, 1) If x and y belong to F there exists z in F which is less 
than or equal to both x and y (i.e., F is a filter base or downward directed), 2) if x belongs 
to F and z is less than or equal to an element y of P, then y belongs to F (i.e., F is upward 
closed). A filter in (P, <) is a proper filter if it is not all of P. If x € P then the set of all 
elements above x is called a principal filter with principal element x. Such a filter is the 
smallest filter which contains x. 
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The notion of a filter is often seen in the context of (F(X), C), the power set for 
some non-empty set X ordered by inclusion C. In this case, a filter F in (A(X), C) 
is seen as being a non-empty subset F of A(X) which satisfies: 


1. F is closed under supersets (i.e., if F € F and F CT then T € F) 


2. F is closed under finite intersections. That is, if A,B € F then ANB belongs to 
F. This generalizes to “the intersection of finitely many elements of F belongs 
to F and so is never empty”. 


When the second condition is satisfied we say that F satisfies the finite intersection 
property. 


For example, if X is an infinite set, the subset F = {U € P(X): X —U is finite } C 
P(X) does not contain the empty set, is closed under supersets and satisfies the finite 
intersection property and so is a proper filter in (A(X), C). When a filter is a subset 
of (A(X), C) we will be using the script font, F, to represent it. Note that F is 
proper if and only if Ø g F. 


Martin’s axiom. 


Martin’s axiom is a statement concerning those sets of cardinality x < 2*° which, when 
hypothesized, allows sets of cardinality «x to behave more like those sets of cardinality 
No then those sets which are of cardinality 28°. We begin by defining the following 
statement called Martin’s K-statement, denoted by MA(x). 


MA(«): Let x be an infinite cardinal and (P,<) be a non-empty partially 
ordered set satisfying the countable chain condition. Let 2 = {D € YA(P): 
D is dense in P} such that |2| < x. Then there is a proper filter F C P 
such that, F A D 4 Ø for every set D € 2. 


When referring to a partially ordered set such as (A(X), C) the Martin’s k-statement 
becomes: 


MA(k) statement statement for a power set: Let k be an infinite cardi- 
nal. Suppose “(X) contains no uncountable family of pairwise disjoint 
subsets of X (i.e., A(X) satisfies ccc). Suppose J* = {12 C A(X): 
2 is dense in A(X)}. If |2*| < k then there is a proper filter F C A(X) 
such that, for every set JE Y*, FADF SE. 


We will first show that MA(No) holds true in ZFC. 
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Theorem 32.4 The statement MA(No) holds true in ZFC. 
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Proof: Suppose (P,<) is a non-empty partially ordered. (Note that the ccc property is 


not required for the Martin’s No-statement to hold true.) Let 2 be a family of dense 
subsets of P such that |2| < Xo. Let a € P. 


Case 1: We consider the case where J is empty. We let F = {x € P: x > a} be the 
principal filter with principal element a. Since F intersects every element of 2 then 
MA(No) holds true. 


Case 2: We consider the case where 0 < |2| < No. We can then enumerate the sets 
in 2 as 2 = {Dj, Do, Ds,...,}. Since, for each i, D; is dense in P we can choose 
some dı € Dı such that then dı < a, choose dy € Də such that də < dı < a, and if 
dn < dn-1 < +++ < dy < a choose dn41 € Dn41 such that dn+1 < -- -dı < a (Choice). 
We now let the set F be one that contains all {d;i }i=1,2,3,.., and all elements of P 
which are above dı. That is, q € F if and only if q = d; or q > a > d;i. We claim that 
F is a filter in P. 1) Clearly F is non-empty. 2) If b is in P such that a < b then 
b € F, 3) if b, c belong to F, then either b < c or c < b. Without loss of generality, 
suppose c < b. Then c < c and c < b. Then F is a filter, as claimed. Furthermore, 
F intersects every D; at d;. So F is the filter which satisfies the condition for MA (Xo). 





We can also show that, in ZFC, MA(2*°) is false. 


Before we prove this we state the following well-known facts about the closed and 
bounded interval X = [0,1]. If U € r(X) (where 7(X) is defined above as being the 
set of all non-empty open subsets of X) then the closure of U, denoted by cl(U), is 
a subset of X where X— cl(U) is the the largest open set which contains elements 
outside of U. A subset is said to be closed if and only if its complement is an open 
subset. The simplest example is: cl(a, b) = [a,b], a closed interval X. 


The following are well-known facts about closed and bounded subsets of R. 


Fact #1: If K is a closed and bounded subset of R and F = {F : F is closed in 
K} is known to be a filter in A(X) (i.e, satisfies the finite intersection 
property) then N{F: Fe F} # Ø. 

Fact #2: If (a,b) is an open interval in X = [0,1] and x € (a,b) C [0,1] then 
there exists c,d € X such that x € (c,d) C cl(c, d) = |c, d] C (a, b). 


Theorem 32.5 The statement MA(2%0) fails in ZFC. 


Proof: Let X = [0, 1]. For each x € X let U, = X — {x} (the complement of {x} in 


X). Then Y = {Uz : x € X} C r(X) where |X| = |X| = 2%, For each z € X, 
let 9, = {D € T(X): cl(D) C Us}. We claim that, for each x € X, 2y is dense 
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in (7(X),C). Suppose M € 7(X). Then there exists some D € Y, such that D C 
cl(D) C MOU, € 7(X). Then 2, is dense in the partially ordered (7(X), C) as 
claimed. Then 2 = {Y, : x E€ X} is a set of dense subsets of (7(X), C) such that 
|2| = 20, 


Suppose MA (2*0) holds true for the partially ordered set (r(X), C). Then there exists 
a proper filter F = {F : F € r(X)} such that, for each z € X, F N Y, is non-empty. 
For each x € X, choose Dg € F N Pzr. Since F is a filter, finite intersections of the 
chosen elements {Dy : x E€ X} are non-empty. Then {cl(D,) : « E€ X} must also 
satisfy the finite intersection property. Since Dy E€ Y, then cl(Dz,) C Zy. Since X 
is closed and bounded (compact) then N{cl(D,) : £ E€ X} # Ø. This contradicts the 
fact that 
{cl(Dz):%E X} OMA: ce X}Cn{Uz: ce X}=a 


The source of our contradiction is our supposition that MA(2*°) holds true. We con- 
clude that MA(2®°) does not hold true in ZFC. 


The two theorems above show that the only Martin «-statements which are of interest 
are those where « is such that No < k < 2%0, 


We then state Martin’s axiom as follows. 


Definition 32.6 Martin’s ariom, MA, is defined as being MA(k«) where « satisfies No < 
k < 20, 


Trivially CH = MA. On the other hand, it has been shown (by Solovay and Martin) 
that Martin’s axiom is independent of ZFC and is consistent with ZFC +—=CH . 


32.6 Consequences of Martin’s axiom in topology. 


There are a few equivalent forms of Martin’s axiom. For those readers familiar with 
point-set topology we present a nicely formulated consequence of Martin’s axiom which 
is of interest. The following statement is in fact equivalent to MA. We prove that it 
is equivalent to MA in the Appendix A. 


Theorem 32.7 [MA] Suppose « is an infinite cardinal such that « < 230. If X is a Haus- 


dorff compact space with ccc and {Ug : a < K} is a family of open dense! subsets of X then 
N{Ua Pape Ø. 


'The subset U is dense in X in the topological sense if and only if cl(U) = X 
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Certain readers may find that this topological statement is reminiscent of the Baire 
category theorem where one version is stated as: “If X is a locally compact Hausdorff 
space and J = {Da : a < No} is a countable set of open and dense subsets in X, then 
N{Da : a < No} is dense in X.”4 


Other well-known topological consequences of Martin’s axiom are: 


— MA(%}) implies that “If X is topological space such every closed subset of X is 
a Gs-set then every subspace of X has a countable dense subset.” 


— MA(;) implies that “A product of ccc topological spaces is ccc”. 
— When MA is assumed: “If k is such that No < k < 20 then 2" = 2%0.” 


— A statement which is equivalent to MA: Let k be such that No < x < 2% and 
let X be a ccc Hausdorff topological space such that the subset {x € X : x has 
a compact neighbourhood} is dense in X. If 2 = {Da : a < k} is a family of 
open dense subsets of X then N{D, : a < K} is dense in X. 


Concepts review: 


1. 


2. 


9. 


10. 


When does a partially ordered set satisfy the “countable chain condition”? 


What is an open subset of the closed interval X = [0, 1]? 


. What subset of A([0, 1]) does 7([0, 1]) represent? 
. What is a strong antichain in a partially ordered set (P, <)? 
. What is a dense subset of a partially ordered set (P, <)? 


. What is filter in a partially ordered set (P,<)? What is proper filter? What is a 


principal filter? 


. What does it mean to say that a family of subsets satisfies the “finite intersection 


property”? 


. State the Martin’s x-statement MA(«). 


State MA(«) when it refers specifically to a power set (Y, C). 
What can be said about MA(No)? 


“Tn fact, it’s similarity to MA is such that some may want to refer to MA as an “Enhanced” Baire category 
theorem. 
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11. What can be said about MA(2*°)? 
12. State Martin’s axiom, MA. 


13. State the Baire Category theorem. 
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33 / Ordinal arithmetic: Addition. 


33.1 


Summary. In this section we define the operation of addition on the ordinal 
numbers. We then show its most basic properties and provide a few examples. 


A definition of ordinal addition. 


Just as for cardinal numbers we can define addition of ordinal numbers. Ordinal 
number addition will be very similar to cardinal number addition. Recall how cardinal 
number addition was defined: 


Let S and T be two sets such that S A T = Ø where S is of cardinality «K 
and T is of cardinality A. We define: k + À as the cardinality of SUT. 


Now the ordinality of a well-ordered set S is the unique ordinal which is order isomor- 
phic to S. If we simply transpose the “cardinal addition definition” onto the ordinal 
numbers, then we would obtain: “If (S,<,) and (T, <,,) are well-ordered sets of ordi- 
nality a and (, respectively, thena+(=°"(SUT).” There is, however, some critical 
information missing here. We can only speak of the ordinality of a set in reference to 
a stated well-ordering of that set. We should then begin by defining a well-ordering 


of SUT. 


Definition 33.1 Let (S,<,) and (T,<,,) be two disjoint well-ordered sets. We define the 


relation “< 


a) 
b) 


c) 


” on SUT as follows: 


—SUT 


u < 


—SUT 


v if {u,v} C S and u <, v. 


u < 


— SUT 


v if {u,v} C T and u <}, v. 


u < 


— SUT 


vifuce S, vET. 


Our next step should be a verification that this newly defined order relation actually 
well-orders the union S UT. We express this in the form of a theorem, and leave the 
straightforward proof as an exercise. 


Theorem 33.2 Let (S,<,) and (T, <) be two disjoint well-ordered sets. Then the rela- 


tion <q, 


well-orders the set S U T. 


ye 


Proof: The proof is left as an exercise. 
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We can now define addition of two ordinal numbers as the ordinality of the union of 
disjoint well-ordered sets. 


Definition 33.3 Let a and ( be two ordinal numbers. Let (S, <) and (T,<,,) be two 
disjoint well-ordered sets of order type a and 8 respectively.! We define a + 8 as follows: 


a+ p= CS UT, Seug) 


The stated definition of addition of ordinal numbers a and ĝ applies only to order 
types of disjoint well-ordered sets (S, <) and (T, <,). If we wish to add the order 
types a and 8 of two sets (S,<,) and (T, <„) with non empty intersection, we can 
construct the sets (S x {0}, <,) and (T x {1}, <,) of the same order types each 


equipped with the lexicographical ordering. That is, 


/\ 


(u,0) < 
(u,1) <, (el) if we 


o (v,0) if wae 


We then add the ordinals a and ĝ as defined. 


Theorem 33.4 Let (S, <s), (T, <+) and (U, <p), (V,<,,) be two pairs of disjoint well- 
ordered sets such that 

ord ¢ =a= ord TT 

ord = B = ord yy 


Then (SUT, <sur) =a +8 = (UUV, <yuy). Hence, addition of ordinal numbers is 
well-defined. 


Proof: The proof is left as an exercise. 


33.2 Examples. 


Addition of natural numbers, when viewed as ordinals, should agree with results ob- 
tained when adding natural numbers the usual way. We have already verified that 
this is the case for cardinal numbers. The following example illustrates that this is the 
case for addition of natural numbers if viewed as ordinals. Note that in the examples 
and theorem below, “<” represents the “ordinal inclusion” order relation €. 


' Addition can also be defined inductively as follows: For all a and 8, a) 8 +0 = 6, b) 8 + (a+ 1) = 
(6+a)+1,c) 8 +a =lub{8 +7: y <a} whenever a is a limit ordinal. 
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a) 


Example. Determine the sum 3 + 7 when these natural numbers are viewed as 
ordinals. Also determine the sum 7 + 3. 


Solution: We see that 3 = °?{7,8,9} and 7 = %*{0, 1, 2, 3,4,5,6}. The choice 
of the natural numbers used is arbitrary. The chosen well-ordered set represen- 
tatives are disjoint. See that °“{7,8,9,0,1,2,...,6} =? {0,1,2,...,9} since 
{7,8,9,0,1,2,...,6} (with the ordering defined on unions) and {0,1,2,...,9} 
(with the usual natural number ordering) are order isomorphic. By definition, 


3+7="{7,8,9,0,1,2,...,6} ="*{0,1,2,...,9} = 10 
TE BS O13 Ov = 10 


Example. Determine both sums wọ + 7 and 7 + wo. 
Solution: So that we obtain disjoint well-ordered set representatives we will use 
T %i{0,1,2,3,4,5,6} 
wo = %?{7,8,9,10,...} 


Then, by definition, 


T+wo = %{0,1,2,3,4,5,6,7,8,9,10,11,...,} = wo 
wo+7 = %{7,8,9,10,...,0,1,2,3,4,5,6}=uo +7 


It is worth noting that 7 + wo = wo < wo + 7 and so, even if addition on finite 
ordinals is commutative, this is not always the case for addition of transfinite 
ordinals. 


Example. Show that wo + 1, when viewed as the sum of the ordinal wọ and the 
ordinal 1 = {0}, is the immediate successor, wo , of wo. 


Solution: 


wo + J. = sore (wo x {0} U {(0, ep actattienas 
4 ({0, 1, 2,3,...} U {wo}) 
{0,1,2,3,...} U {wo} 

= woU {wo} 


= + 
= uy 


33.3 Basic properties of ordinal addition. 


Many of the addition properties generalize from finite ordinal addition to transfinite 
ordinal addition. But we should watch out for those properties that do not. 
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Theorem 33.5 Let a, 3 and y be three (non-zero) ordinal numbers. Then: 
a) (a + B) + SY, = Q + (8 + y) (Addition is associative.) 


b) For any ordinal y > 0,a&a <a +y 


c) For any ordinal y, y <a +y 


e a<bß=>y+a<y+6 


f 


) 
) 

d) a<ß=>a+y<ß+y 
) 
) &a+8B=a+y= B =Y (Left term cancellation is valid.) 
) 


g) at+0=a 


Proof: Let A, B and C be three pairwise disjoint well-ordered set representatives of the 
ordinals a, G and y respectively. 


a) We are required to show that (a + 6) + y =a + (8 + y). By definition 


a (AU B) + ord OY 
= or (A U B) U C] (By definition of ordinal addition.) 
=, od [A U (B U C)] (This is justified below at *.) 

— ad A 4 (BUC) 

= AFEN) 


* Justification of °|(AU B) UC] = °"[AU(BUC)]: The two well-ordered sets 
(AU B)UC and AU (BU C) contain the same elements and so are equal as 
sets. To conclude that they have the same ordinality we must show that their 
elements are ordered in the same way. We will proceed by cases: 


1) If both u and v belong to one of A, B, or C, then u < v in (AU B)UC if 
and only if u < v in AU (BU C). 

2) If u € A and v belongs either to B or C, then u < v in (AU B) UC if and 
only if u < v in AU (BUC). 

3) If u € B andv €C, then u < vin BUC and so u < v in (AU B) UC if and 
only if u < v in AU (BUC). 

In all cases the order of u and v is respected in both sets (AU B) UC and 

AU(BUC). So not only are they equal sets, they are order isomorphic and 

so have the same ordinality. 

b) We are given that a + y =°*(AUC) and y > 0. We are required to show that 
a<a+y. Since C is well-ordered and non-empty it contains a least element, say 
x. Then for every y € A, y < x. Then A= Sy = {u € AUB: u < x}, an initial 
segment of AU B equipped with the ordering <4uyg. Then a is an initial segment 
of a + y and so a € a + y. Equivalently, œ < a + y. 


(a+8)+7y 
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c) We are given that a+ y =°(AUC). We are required to show that y < a + y. 
Since both C and AUC are well-ordered they are either order isomorphic or one 
is order isomorphic to an initial segment of the other (by theorem 26.7). If C 
and AU C are order isomorphic, then y = œ + y. Suppose they are not order 
isomorphic. Then a Æ 0 and so C is a proper subset of AUC. If AUC is order 
isomorphic to a subset of C, then AU C is order isomorphic to a proper subset 
of itself. Since no well-ordered set can be order isomorphic to a proper subset of 
itself, then AUC €wo C; hence, a +y €y,soy<aty. 

d) We are given that a < 3. We are required to show that a +y < 8+7. 
Since a < 8 there exists an order isomorphism f : A — B which maps A to an 
initial segment B* of B. Consider the function g : AUC — BUC such that 
g|a = f and g|o is the identity map on C. Note that a < c for alla € A and 
c E€ C and b<c for allb€ B and c € C and so g: AUC — B* UC is an order 
isomorphism mapping AUC onto B* UC. Since °"(B*UC) < **(BUC) =6+7, 
at+y<Bt+y. 

e) We are given that a < 3. We are required to prove that y +a < y+ 8. 
Since a < 8 there exists an order isomorphism f : A — B which maps A to an 
initial segment B* of B. Consider the function g : CU A —> CU B* such that g|, 
is the identity map and and g|, = f. Note that c < a for all c € C anda € A and 
c < b for all c € C and b € B* and so g[C U A] = CU B* is an initial segment of 
CUB. Since °4(C'U A) = (C U B*) < (CUB) =74+6, then yt+ta<y7+8. 

f) We are given that a+ 8 = a + y. We are required to show that 8 = y. 
Suppose 3 < y. Then by part e), a+ 8 < a + y, a contradiction. Similarly, y < 3 
implies a+ y < a+ p, again a contradiction. We conclude that 8 = y. 

g) We are given that a is an ordinal number. We are required to show that a+0 =a. 
Simply see that a+ 0=(AU{ }) =A =a. 


Remark: In part f) of the above theorem we show that “left cancellation” on addition 
applies just like for natural numbers. However “right cancellation” does not work. 
For example, 


2+wo = “*{0,1} + °{6, 7,8,9,...,} 
oe FO bl {6,7,8,9,...,}) 
~d{0,1,6,7,8,...,} 


= Wo 


Similarly, 3 + wo = wọ. But 2 + wo =3+u9 #2 =3. 


1Note that glAUC] = B*UC need not be an initial segment of BUC. So even if B* UC C BUC equality 
rd B* UC) = *4(BUC) is possible. For example, for U = {0, 1, 3,4,5, ...} and V = {0,1,2,3,4,5,...} we 
have both U C V and “$U = wọ = IV. 
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33.4 Addition of limit ordinals. 


When adding limit ordinals, determining the simplest form for the sum requires some 
thought. For example, 


(wy +7) tur = wr + (7+) 


= w1 +w] = w12 


We provide another approach to addition, for cases where the second term is a limit 
ordinal. Recall that given a non-empty subset A of a well-ordered set W, an upper 
bound of A is any element u of W such that a < u for alla € A. Suppose the element, 
s, is the least upper bound of A. That is, s is an upper bound of A, and, for any 
upper bound u, s < u. In this case we write s = lub(A) (or sup A). 


For example, the ordinal 5 = {0, 1, 2, 3, 4} has least upper bound, lub(5) = 4, since 4 is 
greater than or equal to all of the elements of 5 and it is the least of all upper bounds. 
The limit ordinal wọ = {0,1,2,3,...,} has as a least upper bound, lub(wo) = wo, 
itself. Note that in this case, lub(wo) is not an element of wo. In fact, œ is a limit 
ordinal if and only if lub a = a ¢ a. 


Another property characterizes limit ordinals. The ordinal y is a limit ordinal if and 
only if Uaeya = y. In the case where y has an immediate predecessor, say, 3, then 
y = {0,1,2,3,..., 8} and so 


Uacya =0U1U2U--- U8 =£ 
So even if it is always true that lub y = Uge,a, we have luby = Uge,a = y only in 


the cases where y is a limit ordinal. 


For example, the least upper bound of w1 is wı while the least upper bound of 
wi +3 = {0,1,2,...,w1, w1 + 1,w1 + 2} is wy +2. We now show a useful prop- 
erty involving addition of limit ordinals. 


Theorem 33.6 Let @ be a limit ordinal. Then, for any ordinal, a, 


a+ @8=lub{at+y:7< p} 


Proof: 


We are given that ( is a limit ordinal and a is any ordinal. 
We are required to show that a+ £ is the least upper bound of the set {a +y : y < 8}. 
We claim that a + 8 is an upper bound of the set {a+ y: y < P}: 


— For 6 < 8, by theorem 33.5 part e), œ +ô < a + 8. Soa+( is an upper bound of 
the set {a +y : y < 8} as claimed. 


We claim that œ + @ is the least such upper bound: 
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— Suppose 6 is any upper bound of the set {a +y : y < B}. Then a + y < 6 for 
all y < 8. Suppose 6 < a+. Then for all y, a+y<6< a+. Then there 
exist a least ordinal u € 8 such that ô < œa + u < a + p. Since £ is a limit ordinal 
pt < B, then ô < a+u* < B. This contradicts the fact that a+y < ô for all y < 8. 
Then ô > a+. So a+£ is the least such upper bound of {a+y : y < 8} as required. 


Example: Compute the sum (wo + 7) + 100 + (2 + wo) to its simplest form. 


Solution: 


(wo + 7) + 100 + (2 + wo) wo + 7 


wo +7 


ROFI a 

+ (102 + wo) 

wo + 7) + lub {102 +n : n € wo} 
wo + 7) + wo 

wo + (7 + wo) 

wo +lub{7 +n: n € wo} 


= wo +wo 


( 
( 
( 
( 


NE NL Re Rey 


Concepts review: 


1. Given two disjoint well-ordered sets (S, <s) and (T, <r) define a well-ordering on 
SUT. 


2. For any two ordinals a and 8 how is a+ 8 defined? 


3. For which ordinals does the given property hold true. 


a+bp=a+y>p=y7 
g) a+0=a 


4. If 3 is a limit ordinal simplify the expression sup {a + y : y < 8}. 
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EXERCISES 


1. Suppose 4+ 8 = 8. Prove that wo < 2. 
2. Show that ifa<y,thena+1<y. 


3. Suppose that a and 6 are ordinals such that a < 6. Show that there can only be one 
ordinal 8 such that a+ 6 = ô. 

4. Compute or simplify the sum (50 + wo) + (wo + w1). 

5. Show that if a is a finite ordinal and y is a limit ordinal, then the least upper bound 
of a+y7 is y. 

6. Show that for any ordinal a and limit ordinal y, a+ y is a limit ordinal. 

7. Provide a concrete example of ordinals such that a < 8 and a +y = 8 + y simulta- 
neously hold true. 


8. Let (S,<,) and (T,<,) be two disjoint well-ordered sets. Show that the relation 
<. yr well-orders the set SUT’. 


9. Let (S,<,), (T, <+) and (U, <y), (V, <,,) be two pairs of disjoint well-ordered sets 
such that 


ord ¢ =Q = ord (7 
ord = B = ord yy 


Show that °"(SUT, <sur) =a+8= (UUV, <uurv). Hence, addition of ordinal 
numbers is well-defined. 
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34 / Ordinal arithmetic: Multiplication and Exponentiation. 


Summary. In this section we define the “lexicographic ordering” of the Carte- 
sian product of two well-ordered sets. We then define the multiplication of two 
ordinals a and B as a x B = °"B x A where A and B are their respective set 
representatives. We then list a few basic properties of ordinal multiplication. 
This is followed by a definition of exponentiation and a presentation of a few 
exponentiation properties. 


34.1 Well-ordering the Cartesian product of well-ordered sets. 


The definition of multiplication for cardinal numbers will serve as a model for the 
definition of multiplication on ordinal numbers. Recall how cardinal number multi- 
plication was defined: 


Let S and T be two sets where S is of cardinality « and T is of cardinality 
A. We define: k x A as the cardinality of S x T. 


We cannot simply substitute the words “set” with “well-ordered set” and “cardinality” 
with the word “ordinality” since the ordinality of a set is always expressed in terms 
of a well-ordering on that set. We will order the elements of S x T lexicographically. 
We have discussed this ordering before, but since it is essential in the definition of 
ordinal multiplication we define it formally. 


Definition 34.1 Let (S,<,) and (T,<,,) be two well-ordered sets. We define the lexico- 
graphic ordering on the Cartesian product S x T as follows: 


S1 <s $2 
(s1, t1) Ssyr ($2, t2) provided or 
S1 = 892 and ti <y tə 


Theorem 34.2 Let (S,<,) and (T,<,,) be two well-ordered sets. The lexicographic or- 
dering of the Cartesian product S x T is a well-ordering. 


Proof: The proof is left as an exercise. 


Theorem 34.3 If the well-ordered sets, S4 and S2, are order isomorphic and the well- 
ordered sets, Tı and Tọ, are order isomorphic, then the lexicographically ordered Cartesian 
products, Sı x Tı and S2 x T2, are order isomorphic. 
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Proof: 


We are given onto order isomorphisms f : S1 > S2 and g: Ti > To. 
We are required to produce an onto order isomorphism h : S1 x Ti > S2 x To. 
We define the function h : S1 x Ti > S2 x To as h(s,t) = (f(s), g(t)). 


We show that h is a well-defined one-to-one function on S1 x Ti : Since 


h(s,t) =h(a,b) = (f(s), 9(t)) = (f(a), g(b)) 
= f(s) = f(a) and g(t) = g(b) 


s=a4 and t= b (Since both f and g are one-to-one.) 


then h is one-to-one. (D A 

The function h is onto S2 X Tə : If (s, t) € S2 x To, then, since f and g are “onto” S> and 
Tə respectively, s = f(a) and t = g(b) for some a € Sı and b € Tı; hence, h(a, b) = (s, t). 
Hence, h is onto So x To. 


=> 
=> 


The function h respects the ordering of the sets: 
81 <s $2 
(81, t1) Ssxr (82,t2) © or 
S, = S2 and ti Sa tə 
f(s1) <s f(s2) 
=> or 
f(s1) = f(s2) aa g(t) <p g(te) 
(f(s1), 9(4)) <sxr (F(s2), g(t2)) 
> or 
(f(s1), g(t1)) < (f(s), g(t2)) (Equality <= (s1,t1) = (s2, t2)) 


So h is order isomorphic. 


34.2 A definition of ordinal multiplication. 


We are now set to define ordinal multiplication. At least for finite products, multipli- 
cation is closely linked to addition. For example, 3 x 2 = 3+3 =2+2+2. We would 
expect multiplication of ordinals to be so that wo x 2 equals wọ + wo, for example. We 
propose the following definition. 


Definition 34.4 Let a and ( be two ordinals with set representatives A and B respectively. 
We define the multiplication, a x (, as: 


ax B=%™(B x A) 


The product, a x 8, is equivalently written as, aß, (respecting the order). Note the order 
of the terms in the Cartesian product, B x A, is different from the order, a x (@, of their 
respective ordinalities. 


Part X: Ordinal arithmetic 381 


a) Example. Compute both wo x 2 and 2 x wo. 


wax 2. = “Ctx N) 
et COO (0, 1), (0,2), ..., (0,n),..., (1,0), (1, 1), (1,2), ..., (L n), .--} 


= wo + wo 


~1(N x {0, 1}) 
= %1{(0,0), (0, 1), (1,0), (1, 1), (2,0), (2, 1), ..., (n, 0), (n, 1),...,} 


— wo 


2 X wo 


We see that this multiplication is not commutative. At least in one case of wọ x 2 it 
is linked to the notion of sums. 


b) Example. Order the following pairs. 
i) ((wo, wo x 2), 7) and ((wo, 2x wo), 2 +wo), 
ii) (wo, (wo x 2, 7)) and (wo, (2 x wo, 2 + wo)). 


Solution of i): We compare ((wo, wo x 2), 7) and ((wo, 2 x wo), 2 + wo): 
We first compare the two first coordinates (wo, wo xX 2) and (wo, 2 x wo). 


(wo, wo X 2) = (wo, wo +wo) and (wo,2 x wo) = (wo, wo) 


=> 
(wo 2 x wo) < (wo, wo X 2) 
=> 
((wo, 2 x wo), 2 + wo) < ((wo,wo X 2), 7) 


Solution of ii): We compare (wo, (wo x 2,7)) and (wo, (2 x wo, 2 + wo)): 
(wo x 2,7) = (wo +w0o,7) and (2X wo, 2+ wo) = (wo, wo) 


=> 
(2x wo,2+wo) < (wox 2,7) 
=> 
(wo, (2 x wo, 2 +wo)) < — (wo, (wo x 2,7)) 


34.3 Basic properties of ordinal multiplication. 


We show that some of the multiplication properties generalize from finite ordinal mul- 
tiplication to transfinite ordinal multiplication. 
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Theorem 34.5 Let a, 6 and y be three ordinal numbers. Then: 


a) (yB)a = (Ba) (Multiplication is associative.) 


b 


For any y > 0, a < b => ya < yf 


Cc yla + B) = ya + yB (Left-hand distribution is acceptable.) 


e) y0=0 


) 
) 
d) For any y > 0, ya = 78 >a =$ 
) 
) 


(Left-hand cancellation is acceptable.) 


f) For any limit ordinal 8 4 0, aß = lub {ay : y < 8} 


Proof: Let A, B and C be three pairwise disjoint well-ordered set representatives of the 


ordinals a, G and y respectively. 


a) We are required to show that (A x B) x C is order isomorphic to A x (B x C). 


(ax p)xy = 


(Bx A) x "iC 
(C x (Bx A)) 
(C xX B) x A) (This is justified below at *.) 
ord 4 x G x B) 


ax (Gx 7) 


* That the sets C x (B x A) and (C x B) x A are equipotent follows from theorem 
4.9. We now show that the one-to-one function f(a, (b,c)) = ((a, 6), c) which 
maps A x (B x C) onto (A x B) x C respects the ordering: 


es a Wed { in Ax (Bx C) : — (a, (b, )) < (c, (d,e)) 
in (Ax B)xC: (a,b) < (c,d) =>  ((a,b),c) < ((c,d),e) 


— = 
m =m 
—_— er 
arg eo Q 
I i Ag | 


d 
< e 


Ti i { in Ax (BxC): (b,c) <(d,e)= (a,(b,c)) < (c, (d,e)) 
d in (Ax B)xC: (a,b) <(c,d)=> _ ((a,b),c) < ((c,d), e) 


5 { in Ax (BxC): (b,c) <(d,e) => (a,(b,c)) < (c, (d,e)) 
in (A x B) xC: -m ((a, b), c) < ((c,d),e) 


We have considered all cases and see that the ordering is respected. 


b) We are given that y > 0 and a < 8. We are required to show that ya < yf. 
Since a < ĝ, there exists an order isomorphism f : A — B mapping A to an initial 
segment in B. It suffices to show that C x A is isomorphic to an initial segment of 
C x B. Define the function g : C x A— C x B as follows: g((c, a)) = (c, f(a)). 
Since f is one-to-one into B, g is easily seen to be one-to-one. 
We show that g respects the order: 
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— Suppose (u,v) Soxa (5, #). 

—Ifu=s, then v <4 t and so (u, f(v)) <o. 5 (u, f(t) = (s, f@) 
— If u <c s, then (u, f(v)) <oxg (s, fd). 

— So g respects the order. 

c) The left-distributive property proof is left as an exercise. 

d) We are given that y > 0 and ya = y8. We are required to show that a = £. 
Suppose not. Suppose, without loss of generality that a < 8. Then by part b), 
ya < yß, a contradiction. So a = ĝ, as required. 

e) We are required to show that y x 0 = 0. 

Simply note that y x 0 ="({ }x C) ="*{ } =0. 


f) We are given that (@ is a limit ordinal not equal to 0. We are required to show that 
ab = lub {ay : y < p}. 
We claim that aĝ is an upper bound of the set {ay : y < 3}: 


— Note that y < 6 = ay < aß (by part b). So aĝ is an upper bound of the set 
{ay : y < 3} as claimed. 
We claim that if ô is an upper bound of the set {ay : y < 8}, then ap < ô: 
— Let 6 be an upper bound of the set {ay : y < 8}. 
— We claim that 6 > alub{y: y < p}: 
x Suppose ô < alub {y : y < 8}. 


ô< alub{y:y <8} > Scala 


<8 
> d€ay For some y < 2 
=> d6<ay 


x But 6 < ay contradicts the fact that ô is an upper bound of the set {ay : y < 
6}. 
x So ô > alub{y : y < 8} as claimed. 
— Since @ is a limit ordinal, then 8 = lub {y : y < 8}. 
— So ô > aß as claimed. 
Combining the two claim statements, we obtain a8 = lub {ay : y < 8}. 


34.4 Examples. 


When performing ordinal arithmetic it is not always obvious how to simplify expressions. 
In the following examples we show how some of the properties shown above can be used to 
simplify expressions. 
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a) 


By mathematical induction, show that for any ordinal a and finite ordinal n < 0 
an=at+act+-::--+a (n times) 


Solution: Induction on the natural numbers. Let A be a set such that °*A =a. 

Let P(n) denote the statement “an = a+a+---+@ (n times)” 

Base case: Since al = °*"*({0} x A) =°*A =a, P(1) holds true. 

Inductive hypothesis: Suppose P(n) holds true. Then, by left-hand distributivity of 
ordinals, a(n + 1) =an+al=an+a=a+a+-::+Q (n+1 times). By mathematical 
induction an = a + a +- +Q (n times) for all non-zero finite ordinals n. 


Define a +a +a +- -++ (Countably infinite times) = lub{a, a +a, at+a+a,...}. Show that 
for any a, 
QwWo =Q +++- + (Countably infinite times) 
Solution: 
awo = lub{an:n < wo} 
= lub{a, a2, a3,a4,...} 
= lub{a, ata, at+a+ta,...} 


= Q + Q + Q + * ++ (Countably infinite times) 


Show that mwo = wo for any non-zero finite ordinal m. 


Solution: 
mug = lub{mn:n< wo} 


= Wo 


We define we = wow. Express we = wow as an infinite sum illustrating what this means. 


Solution: 
lub {won : n < wo} 


wowo 


lub {wo, wo + wo, wo + wo + wo, . .-} 


Wo -+ Wo + Wo + * ++ (Countably infinite times) 


34.5 Some ordinal comparisons. 


The following table may be of some help in seeing how countably infinite ordinals are 
ordered. The ordinals are increasing with respect to € as we go down the table. All 
the ordinals in the third column are limit ordinals. There is no end to this process, 
so there is an unlimited source of ordinal numbers. 
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34.6 





0 < 0,1,2,3,... < wo 

wo < wo, wo + 1, wo +2, wo +3,..., < wo + wo 
wo2 < wo2, wo2 +1, wo2+ 2, wo2 +3, ..., < wo2 + wo 
wo3 < wo3, wo3 +1, ..., wo4, ..., wod, ..., < wowo = we 

wowo =we < Derde harn we+wo, ..., ..., we +wowo = wi+tuwe = w2 

we2 < we2, wR24+1 ..., wh3, ..., wed, ..., wewo = we 

we < Wee We EV eaten Was, shah Won Sages a: < wo? 

wy? < wo, wer +1, ..., (WEP), «2, (U PO), aay Wi,- 


Many of the ordinals listed above may seem incredibly large. In spite of this, note 
that every one of these ordinals is countable! 


Observe that we go from one limit ordinal to the next by adding a countably infinite 
set, remembering (from theorem 19.6), that adding a countably infinite set to some 
infinite set S does not change the cardinality of S. We see this in more detail in the 
following table. 





wo E W92 m W093 > +: > we 
>w eV Se ot pO ee ee? ey 
0 0 0 0 
=> wi + wo +1 wi Hw +2 +++ > wi + w92 
— we + we = w2 > --- > uu = 8 


Sug +1 =ug+23--- 49 


Recall that all of these are elements of the first uncountable ordinal is wı (defined as 
the least ordinal which cannot be embedded in wo). The ordinal w; is uncountable. 
It is not constructed (other than being the union of all countable ordinals), but its 
existence is guaranteed by Hartogs’ lemma (see 28.9, 28.10 and 28.11). 


Ordinal Exponentiation. 


We now define, by transfinite recursion, exponentiation of ordinals . 


Definition 34.6 Let y be any non-zero ordinal. We define the y-based exponentiation 
function gy : Ê — @ as follows: 


1) 
2) 


3) 


g(0)=1 


g(a) = gyla)y 


g(a) = lub{g, (8) : 8 < a} whenever a is a limit ordinal. 
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Whenever y # 0 we represent g(a) as y*. Then y°*! = y°y. If y = 0 we define 
y* = 0% = 0. 


Many of the principles used in the computation of expressions involving exponentia- 
tion of natural numbers extend to large ordinal numbers. We begin with expressions 
involving inequalities. 


Theorem 34.7 Let a, 6 and y be three ordinal numbers. Then, assuming y > 1, 
a< Bagal 
Proof: In what follows, 1 < y. 


(= ) Proof by transfinite induction. Let P(8) denote the statement “a < 8 > 7% < qP”. 
Then P(0) holds (vacuously) true. Suppose P(ô) holds true for all 6 < 8. That is, 
suppose a <6 > 7%% < 7° for any ô < $. 

Case 1: 8 is a successor ordinal. That is, @ = u + 1 for some ordinal u. Suppose 
ô< B = u+ 1. It suffices to show that 7° < 7°. 


=u > =" 
> 4°(1) < yy (By theorem 34.5) 
> q? < qHH = yÊ (By definition) 
â< u = y? ca y” (By induction hypothesis)) 
= 7? < (1) < gy =T = 9? 
In both cases, 5 < 8 implies 7° < 4ô. 


Case 2 : B is a limit ordinal. That is, 6 = lub{d : 6 < 6}. We are given that 
< u< B= 7? <7". Suppose ô < 8. It suffices to show that 7° < ~ê. 


6<B => 6<641< 
> q? < noth < lub{+? 0 S00 yÊ (By 3) in Definition 34.6) 
= <f 


In both cases 1 and 2, 6 < 3 implies 7° < 7°. 


( = ) Suppose 7° < yÊ. If B < 6 then by the first part above we have both y° < 4° 
and 7° < 7°, a contradiction. Then ô < 6. Since 5 = G implies 4f = y°, we must have 
ô < G, as required. 
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Theorem 34.8 Let a, @ and y be three ordinal numbers where a Æ 0. 


a) yy = 7 


B+a 


b) (78) = 2 


Proof: 
a) 


Proof by transfinite induction. Let P(5) denote the statement “yg? = y+, 
Then P(0) holds true. Suppose P(d) holds true for all ô < a. That is, suppose 
yÊ’ = yt’ whenever ô < a. 
Case 1 : a is a successor ordinal. That is, a = u + 1 for some ordinal u. Then 
PE E Pe 
=at 
Sayre 
= ny B+n+1 
= PHC) 


= nba 
Case 2 : a is a limit ordinal. That is, a = lub{ô : 6 < a}. We are given that 


84> = 8+ whenever ô < a. We are required to show that y?7% = fte, We 
know that y% = lub{7°:6 < a}. 


Hoy (By the inductive hypothesis) 





We claim that Y~ must be a limit ordinal: Suppose not. Then y“ = u + 1 for some 
ordinal u. Then u < 7° for some ô < a (for, if not, u < 7% = lub{7°:d < a} < u, 
a contradiction). Then (by 34.7) u+ 1 < 7% < y% = u + 1, a contradiction. So y% 
is a limit ordinal as claimed. 


We claim that y8t+* < P42: 
By definition 34.6 c), Pta = lub{7? p ô < B+a} (“a is limit ordinal” > “8 + a is a limit ordinal”). 
6<Bt+a => 6<P+p<6+<a for some u <a 
> y? < PTH == Pay < Py (ay induction hypothesis and theorem 34.7) 


Hence y+ = lub{y°:6 < B +a} < 77%, as claimed. 


We claim that yPy% < yet: 
Since 4® is a limit ordinal yy% = lub{y°6 : 5 < 7%}. (Theorem 34.5 £) ) 
Ô<% => <q" <q“ for some u < a 
> 5 < Pal = PtH < myP+O oy induction hypothesis.) 


Then yy% = lub{y6 : 5 < y%} < 78+, as claimed. 


B+a 


We conclude that yy% = y°+°, as required. 


A. 
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b) This can be proved by transfinite induction as in part a). Let P(d) denote the 
statement “(y°)° = y®°”. Then P(0) holds true. Suppose P(d) holds true for all 
6 <a. That is, suppose (yf)? = yf? whenever 6 < a. As in part a) consider the two 
cases, 1) a has an immediate predecessor, and 2) a is a limit ordinal, separately. 
The details are left as an exercise. 


Concepts review: 


1. When defining multiplication of ordinals, what kind of ordering is used on the Carte- 
sian product of the ordinals being multiplied? 


2. Define the multiplication of two ordinals a and 2. 


3. How does the definition of ordinal multiplication compare with the definition of car- 
dinal multiplication? 


4. Construct a set representation for the ordinal product w2. 
5. How does the ordinal wo3 compare with the ordinal 3wo? 
6. Is left-hand distribution acceptable in ordinal multiplication? 


7. If we start with the countably infinite ordinal wo and gradually increase its ordinality, 
one at a time, by an endless process of ordinal addition and multiplication, can we 
reach wı with this process? 


EXERCISES 


1. Simplify or describe the following expressions. 
a) wo + wg 
b) (wo +2) + wo 
c) w2 +1 
2. Which ordinal is larger, 3wo or wọ3? Explain. 
3. Does the ordinal we + 3wo have an immediate predecessor? If it does describe it. 


4. Find two ordinals whose ordinal product is wo + wé. 


5. Prove that if ô < aß, then 6 = uy for some u < a and some y < /. 
6. Show that (wo + wo)wo = wowo 


C. 
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7. Prove that al = la=a. 


8. Prove that if œ and 8 are both finite ordinals, then aß = pa. 


9. Prove that if a8 = 0, then either a or 8 is 0. 
10. Prove that y(a+ 8) = ya + y6. 
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A / Boolean algebras and Martin’s axiom. 


Summary. We discuss those partially ordered sets called lattices and Boolean 
algebras. Any Boolean algebra B is then shown to have a topological representa- 
tion B(.SY(B)). Finally characterizations of Martin’s axiom are given in terms 
of these concepts. 


A.1 Lattices. 


Given a partially ordered set (P,<) there may be pairs of elements a, b in P such 
that a and b do not have a common upper bound or a common lower bound in P. 
Those partially ordered sets in which every pair of elements in P have lower and upper 
bounds play an important role in mathematics. We refer to these sets as lattices. 


Definition 0.1 A partially ordered set (P, <) is called a lattice if a V b = max{a, b} and 
a/b = min{a, b} both exist in P for all pairs a, b in P. 


Definition 0.2 If B is a subset of a partially ordered set, (P,<), VB denotes the least 
upper bound of B and AB denotes the greatest lower bound of B (both with respect to <). 
Note that VB and AB may or may not be an element of B. A lattice (P, <) is said to be a 
complete lattice if for any non-empty subset B of P, both VB and AB exist and belong to 


P: 


Examples of lattices. 


1) 


Let X be a set (A(X), C) be a partially ordered set ordered by inclusion. Let A 
and B be any pair of subsets of X (read, A, B € A(X)). We define AVB = AUB 
and AA B = ANB. For Z C A(X), VZ = U{A € P(X): A € B} (read, 
“/ is the union of all subsets, A, of X such that A € Z”). Also define 
AZB = UAC P(X): A € B} (read, “AZ is the intersection of all subsets, 
A, of X such that A € Z”). In this case, with V and A^ defined as U and fM, 
respectively, (A(X), C,U, N) is a complete lattice. 


Let 7(X) be a topology on the set X. Let A and B be any pair of open sub- 
sets of X (read, A,B € 7(X)). We define AV B = AUB and AAB = 
ANB. For Z C A(X), VZ = UJA € T(X): A € B}. Also define AZ = 
intx (N{A € A(X): A€ B}). In this case, with V and A defined as U and f, 
respectively, (r(x), C,U,M) is a complete lattice in (A(X), C). 
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3) Let X be topological space. Let A(X) = {AC X : A is clopen in X}° partially 
ordered by inclusion C. If A and B are clopen subsets of X we define AV B = 
AUB and AAB=ANB. Then (&(X),C,U,N) is a lattice in (A(X), C). 
But it is not necessarily complete. Verify that (4(Q), €,U,N) is not a complete 
lattice. 


Definition 0.3 Let X be a topological space. A subset B is said to be regular open in X 
if B = intx(clx(B)). The set of all regular open subsets of X will be denoted as Zo(X). 


For example, (—5,0)U (0,5) is open in R but not regular open in R. 


A particular example of a complete lattice. Let X be a topological space with a 
Hausdorff topology r(X). By definition, Zo(X) C 7(X). For the partially ordered 
set (Zo(X), C) we define A and V as follows: For A, B € £o(X), 

AAB = ANB 

AVB intx (clx(AU B)) 


If we want to form a lattice (@o(X), C, V,M) in the partially ordered set (7(X),C), 
we cannot define AV B as AUB since it is not true, in general, that intx(clx(A)) U 
intx (cly(B)) € Zo(X). To see this consider, for example, the open intervals A = 
(a,b) and B = (b,c) both elements of Zo(R). See that 


intr(cly(A U B)) = (a,c) # (a,b) U (b,c) = AUB 


Hence #o(X) is not closed with respect to the union, U, of finitely many sets. The 
following statement confirms that (@o(X),C,V,M) is a complete lattice in the par- 
tially ordered set (7(X), C). 


Theorem 0.4 Let X be a topological space. Then (%0(X), C,V,M) is a complete lattice 
in (T(X), C). 


Proof: It immediately follows from our definition of V that, for A, B € Zo(X), AVB€ 
HZo(X). We leave it as an exercise to show that 
AN B= intx(clx(A))N intx(clx(B)) = intx(clx(AN B)) € G(X) 
Suppose 2 C &o(X). Showing that 
AB: BEG = intx(cly(N{B: BE 2})) € LX) 
V{B: BEY} = intx(cly(U{B: BE 9})) € B(X) 


5A set is clopen if it is simultaneously open and closed in X 
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is also left as an exercise. Then (40(X),C,V,M) is a complete lattice in (r(X), C) as re- 
quired. 


A.2 


The partially ordered set Zo(X ) forms a base for a topology on X. Since Ø, X € Ko(X) 
and £o(X ) is closed under intersections, N, then Zo(X) forms a base for some topol- 
ogy. That is, Zo(X) generates some topology T*(X) C r(X). If (X,7(X)) is assumed 
to be Hausdorff, r(X) separates points of X; it then easily follows that Zo(X) also 
separates points of X. We have shown that (X,7*(X)) a Hausdorff on X.6 


Lattice filters. 
We will consider a lattice (L C, V, A). A subset F of (L C,V,A) is called an L-filter 
if 

1) F is non-empty, 


2) F is such that, for non-empty A,B € F there exists D # Ø such that D < 
ANBEF) 


3) F is such that, if A € F and A < C E L then C E€ F). 


When only conditions 1 and 2 are satisfied we say that F is an L-filter base. We 
say that the L-filter base, F, generates the L-filter, #'. If F € L, then {F} is the 
L-filter generated by the singleton {F}. A filter F is said to be a proper L-filter if 
F L. A filter F is proper if and only if Ø Z F. 


Definition 0.5 Let (L, <,V,^) be a lattice. An L-ultrafilter is a proper filter F in L 
which is not properly contained in any other proper filter in L. If the filter F is such that 


AF 


: F E€ F} # @ then we say that the filter F is a fired ultrafilter. Ultrafilters which 


are not fixed are said to be free ultrafilters. 


Theorem 0.6 Let X be a topological space. 


a) 


b) 


Suppose ¥ is a proper L-filter where (L, C, V, A^) is a lattice in (A(X), C). Then F 
can be extended to an L-ultrafilter. 


Suppose ¥ is an L-filter in (L,C,V,A) a lattice in (A(X),C). Then F is an L- 
ultrafilter if and only if for every A C X, either AG F or X-AE F. 


°Given a topological space (X,7(X)) the set (X,r*(X)) is referred to as the semiregularization of 
(X,7(X)). 
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Proof: 


a) Let F be a proper L-filter. Let # = {M : M is a proper L-filter such that F C M}. 
We partially order # with C. Let @ be a chain in (#,C). Then U{C : C € €} is an 
upper bound of @ with respect to C. So every chain in # has an upper bound. By Zorn’s 
lemma, (#,C) has a maximal element. That is, # contains a filter, F*, which is not 
properly contained in any other filter. Since F* € H, F C F*. Then F can be extended 
to an L-ultrafilter, as required. 


b) ( = ) We are given that F is an L-ultrafilter in L C A(X) and that A C X. Suppose 
neither A nor X — A belongs to F. Let X ={BeL: ANF C B for some FE F}. 
Clearly, F C HM and Ac KH — F. Then F is a proper subset of #. Note that S € H 
for, if it was, AN F = Ø for some F and so F C X — A which would imply X- AE F, a 
contradiction. It is a straightforward exercise to show that # is closed under finite appli- 
cations of A. Hence # is an L-filter base which will generate a filter #*. Then F C #* 
which contradicts the fact that F is an L-ultrafilter. 

( = ) Conversely, suppose that for every A C X, either A € F or X— A €E F. We 
are required to show that ¥ is an ultrafilter. Suppose not. That is, suppose that there 
exists a proper filter, #, such that F C #. Then there exists some non-empty A such 
A E€ H — F. Then X — A cannot belong to F, for, if it did, then AN ee A)=2 
must belong to #, contradicting the fact that # is a proper filter. Hence F must be an 
L-ultrafilter. 


Theorem 0.7 Let X be a topological space. Then (%0(X), C,V,M) is a complete lattice 
in (7(X), C). An £0(X)-filter, F, is an Zo(X )-ultrafilter if and only if, for any A € Zo(X), 
either A or X — clx(A) belongs to F. 


Proof: 
( = ) Suppose F is an £0(X )-ultrafilter and A € Zo(X). Let Fe F. Case 1: If FAA = Ø 
then (since F is open) F C X —clx A. Since 


X—clyA intx (X — A) 
intx (X — intxclx A) 


= intxcly (X E clx A) 


F C X — cly A € Zo(X) which implies X — cly A € F. Case 2: Suppose F N A # Ø for 
all FE F. Suppose A ¢ F. Let H = {B E H(X): ANF C B}. Then F C # and 
AEH — FI. As shown in iie theorem above, # is a filter base which generates a filter 
H™* in (Fo(X), C). Since F C H* this contradicts the fact that F is an Zo(X )-ultrafilter. 
So A must belong to F. 

Gee) pubbose 1 that for any A € &o(X), either A or X — clx(A) belongs to F. See 
that the filter F extends to an £0(X)-ultrafilter F*. Suppose A E€ ¥*. If A g F then 
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X—clyA€ F C F* implying that AN(X —clx A) = Ø E€ ¥*, a contradiction. So A E€ F. 
Then F* C F which implies that F is an Zo(X)-ultrafilter. 


It can similarly be shown that a 7(X)-filter, F, is a 7(X)-ultrafilter if and only if, for 
any A € 7(X), either A or X — clx (A) belongs to F. 


A.3 Boolean algebras. 


We now define certain lattices with special properties. 


Definition 0.8 A lattice (L, V, ^) is said to be a distributive lattice if, for any x, y, and z 
in L, ev ye) = (eV yA eV 2) and &A (YV 2). = na) Vv (ene): 


- The lattice (L, V, ^) is said to be a complemented lattice it has a maximum element, 
denoted by 1, and a minimum element, denoted by 0, and for every x € L there exists 
a unique x’ such that x V x’ = 1 and z A x’ = 0. 


- A complemented distributive lattice is referred to as being a Boolean algebra. A 
Boolean algebra is denoted as (B,<,V,A,0,1,’) when we explicitly want to express 
what the maximum and minimum elements are. 


Examples. The lattices (A(X), C,U,N, X, Ø’) and (A(X), C,U,N, X, Ø, ’) are the 
simplest examples of Boolean algebras. In both case A’ = X — A. 


The lattice (Zo(X), C, V,N, X, Ø, ") is also a Boolean algebra, although this is not at 
all obvious. To prove this one must show that, for any A, B,C € £(X), 

- X —clyA = intxcly(X — clx A) € H(X), 

- AN(X-clxA)=2 

- AV (X-— clx A) = intx(clx(AU (X— clxA))) = 

-AV(BNC) =(AV B)N (AVC) and AN(BVC) = (ANB) V (ANC) 


Proving these is left as an exercise. 


Note that in the case where A € Zo(X), A’ = X— clxA. It is erroneous to interpret 
A’ as meaning X — A. 


Like lattices (L,C, V, ^) in A(X) a Boolean algebra (B,<,V,/A,0,1,’) contains B- 
filters and B-ultrafilters. The following concepts and properties are slight generaliza- 
tions of ones we have already seen, so they will seem familiar to readers. 
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Boolean filters and ultrafilters. If (B,<,V,A,0,1,’) is a Boolean algebra then a B- 
filter, F, is a non-empty subset of B which is closed under finite applications of A and, 
for any x E€ F, (x < y) > (ye F). We say that the B-filter, F, is a B-ultrafilter if 
F is a proper filter and is not properly contained in any other proper B-ultrafilter. 


We will emphasize three important B-ultrafilter properties. For any B-ultrafilter, F, 
— whenever x Vy € F, either x E€ F or y E€ F. 


— ifx € B and z ^y # @ foral y € F then y E€ F. 
— for any x € B, either x € F or x’ € F. To see this suppose x ¢ F. 


Proving these is left as an exercise. 


Definition 0.9 Suppose we are given two lattices (B1, <1, V1, A1, 0, 1,” ) and (Bo, <2, V2, ^2, 0, 1,’) 
and a function f which maps elements of Bı to elements of Bə. We say that f : Bı > Bo 
is a Boolean homomorphism if, for any x,y € B, 


1) f(x Vi y) = f(x) V2 f(y), 
2) f(z My) = f(x) ^ f(y) 
3) f(x) = flay’. 


The function f : Bı — Bg is a Boolean isomorphism if f is a bijection and both f and f© 
are Boolean homomorphisms. 


We say that f : Bı — Bə is an order homomorphism if (x <ı y) => (f(x) <2 f(y)). It 
can be shown that: 


1) If f : Bı — Bə is a Boolean homomorphism then f must be an order homomor- 
phism. 

2) If f : Bı — f|Bı] is a bijective Boolean homomorphism then f must be a Boolean 
isomorphism. 


Proving these is left as an exercise. 


Topological representations. We will say that a Boolean algebra (B, <, V, ^, 0,1, ) has 
a topological representation if the there exists a Boolean isomorphism f : B > Z(X) 
mapping B onto A(X) where (A(X), C,U,N)) is the Boolean algebra of all clopen 
sets on some topological space X. 
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Definition 0.10 Let (B,<,V,/A,0,1,’) be a Boolean algebra. Let Y(B) = {V : V isa 
B-ultrafilter}. We define the function fg : B => P(Z(B)) as follows: falx) = {F € 
S(B): cE DM}. 


Theorem 0.11 Let (B,<,V,A,0,1,’) be a Boolean algebra. Then the set {fp(x) : x € B} 
is a base for the open sets of some topology, T(.7(B)), on the set Z (B) of all B-ultrafilters. 


Proof: 


Let Zg = {fa(x): a € B}. We first show that the sets in @g cover all of Y(B). Since 
B-ultrafilters are proper filters no ultrafilter can contain 0; then fg(0) = @ € Zg. Also, 1 
belongs to all B-ultrafilters hence fg(1) = (B) € Zg. Suppose fg(x) and fe(y) belong 
to Zg. Any B-ultrafilter F contains x A y if and only if it contains both xz and y. Then 
F € fp(xAy) ifand only if F € fa(x)Nfe(y). We can then write fa(xAy) = fa(x)N fay). 
Then the set Zg = {fp(x): x € B} C A(.Y(B)) is a base for the open sets of some topol- 
ogy on F (B). 


For a given Boolean algebra (B,<,V,/,0,1,’) we can now speak of a topological 
space (F (B), T(Z(B))) which is associated to B. Its elements are B-ultrafilters. 
When equipped with this topology the set .Y(B) is referred to as the Stone space. We 
now describe a few properties of the function fg which associates B to subsets of the 
topological space .Y(B). We refer to this important theorem as the Stone represen- 
tation theorem. 


Theorem 0.12 Let (B,<,V,A,0,1,’) be a Boolean algebra. 


1) The function fg : B — Y(.A(B)) is a Boolean homomorphism mapping B into 
P(S(B)). 


2) For every x € B, fg(x) is clopen in J (B). Hence fg|B] C A(.A(B)) (the set of all 
clopen sets in Z (B)). 


3) The function fg : B — (B) is a Boolean isomorphism mapping B into A(.Y(B)) 
(the set of all clopen sets in .7(B)). 


4) The topological space (F (B), T( Z (B))) where r(.Y(B)) is the topology generated by 
the open base {fp(x) :  € B} is a compact zero-dimensional Hausdorff topological 
space. 
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The Boolean isomorphism fg : B —> (B) maps B onto A(.Y(B)) (the set of all 
clopen sets in .Y(B)). 


Proof: 
We are given that (B, <, V,A,0,1,’) is a Boolean algebra. 


1) 


We have shown in the previous theorem that falx A y) = fe(x) fa(y). 

We now show that falx V y) = fa(x)U fa(y): If F € fe(x)U fey) then either 
x or y belongs to F. By definition of a filter, x Vy E F, hence F € fp(x Vy). 
Then f(x) U fa(y) C fa(a V y). On the other hand, if F € fge(x V y), then 
xVy€ F. By a property of B-ultrafilters described above, either x or y belongs 
to F. Hence F either belongs to fp(x) or to fe(y). Then fa(aVy) C fa(x)U fa(y). 
So fg(z V y) = fa(x) U fay). 

We show that f(a’) = fgl) = 7(B)—fp(x): We know that fp(x)Nfp(2’) = fg(£^ 
x’) = fp(0) = Ø. We also know that fg(x) U fa(2’) = fgle V 2’) = fe(1) = A(B). 
We conclude that fp(2’) = Z (B) — fp(z). 

From this we conclude that fg : B— A(.Y(B)) is a Boolean homomorphism. 

We are required to show that, for every x € B, fg(x) is both an open and closed 
subset of (B) (with respect to the topology generated by the base {fp(x) : x € 
B}). For each x in B, fg(x) is open in (B) (since it is an open base element). 
Since fp(x’) = fp(x)’ is open and fa(x) = (B) — f(x)! (shown above), then 
f(x) is also closed. So every element of {fp(x) : « € B} is clopen. It follows that 
fe[B] = {fa(x): x € B} CAS(B)) € A(SA(B)). 


Having already shown that fg : B — fs|B] C B(Y(B)) is a homomorphism. To 
show it is an isomorphism it suffices to show that fg is one-to-one. 


The function fp is one-to-one: Suppose x and y are distinct points in B. Since x Æ y 
then one of the two statements x < y, y < x must be false. We will assume, without 
loss of generality, that x £ y. We define x — y = xz ^A y'. We claim that since x £ y 
then x — y #0. For suppose that x £ y and x Ay’ = 0. See that 


g=a’vV0 => a vi(zry’) 
=> avi(zry) 
(2! V x) A (2! V y’) (B is distributive) 
LA (xz Vy’) 
(æv y’) 
y' < qr! 


{UYU YU 


zY 


We have a contradiction. The source of the contradiction is our supposition that 
rAy' = 0. Sox—y £0 as claimed. Let F be the B-filter {x — y}! generated by 
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x—y € B. Then ¥ extends to a B-ultrafilter F*. Then F* € fa(x — y). But 
x and y’ are both above x — y = x Ay’ and so must both belong to ¥*. Then 
F* € fa(x)N fay’). Then F* cannot belong to fp(y) (for if it did, y and y’ would 
both belong to ¥*). Then fp(x) 4 fe(y). We conclude that fg is one-to-one on B. 


Let (F (B), T(F(B))) be the topological space where 7(.7(B)) is the topology gen- 
erated by the open base {fg(a) : « € B}. A topological space is zero-dimensional if 
it has an open base of clopen sets. We have shown above that {fg(x):x € Bhisa 
set of clopen subsets of (B). So Z (B) is zero-dimensional. 


The topological space (.Y(B),7(.A(B))) is Hausdorff, Let Fı and Fz be distinct B- 
ultrafilters in (B). Then there exists some element x € F> which does not belong to 
F,. Then x’ must belong to Fı. It follows that Fı € fp(x) and Fz € f(x’). Since 
xa’ =0 implies fp(x)N fa(2’) = Ø it follows that .W(B) is a Hausdorff topological 
space. 


The topological space (.Y(B),7(.A(B))) is compact: To show that Z (B) is compact it 
suffices to show that a set {F;};e7 of closed subsets which satisfies the finite intersection 
property has non-empty intersection. Let # = {Fier be a collection of closed 
subsets of Z (B) which satisfies the finite intersection property. Then # is a filter 
base of closed sets. Let #* be the filter of closed subsets which is generated by the 
filter base #. Let 


T={xe€B:FC fp(az) for some F € #*} 


Since {fg(x) : x € B} is a base of clopen sets in .Y(B) it is a base for closed 
sets in (B) and so every closed set in (B) is the intersection of elements in 
{fa(z): a2 € B}. Then Q{F : F € #*} =r{fa(x): 2 € T}. Suppose x,y € T. 
We claim that x ^y € T: There exists F; and F; in #* such that F; C fg(x) and 
F; C fa(y). Then F; N F; C fa(z) fgly) = fglx ^y). Since F; N Fj must be 
non-empty and belong to #* then x Ay € T, as claimed. From this we deduce that 
T is a filter in B. Now T extends to the B-ultrafilter 7. Since x € J for all x € T 
then J €M{fa(z): ec E€ T} HN{F: Fe A*}. Then {F : Fe H} # ø. We 
conclude that .7(B) is compact, as required. 


Let A € B(F(B)), the Boolean algebra of all clopen sets in (B). Since A is 
open then A = U{fp(x) : x € K} for some K C B. Now A is also closed and 
so is compact in (B). The collection {fg(x) : « € K} is an open cover of A 
and so A has a finite cover. That is, there is a finite subset M C K such that 
A=U{fp(x): 2 € M} = fg(v(x € M)). We conclude that A € fs|B]. That is, fg 
maps B onto &(.Y(B)), as required. 
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A.4 Characterizations of Martin’s axiom. 


We can now present and prove a few characterizations of Martin’s axioms. 


Theorem 0.13 Let « be an infinite cardinal number such that x < 230, Then the following 
are equivalent: 


1) (Martin’s axiom MA) If (P, <) is a partially ordered set satisfying ccc and 2 = {Da : 
a < k} is a family of dense subsets of P, then there exists a filter F on P such that 
F N Da # Ø for each a < «r. 


2) If X is a compact Hausdorff topological space satisfying ccc and J = {Da : a < k} is 
a family of dense open subsets of X, then N{Da : a < K} # Ø. 


3) If (B,<,V,A, ’) isa Boolean algebra with the ccc property and J={Dyg:a<khisa 
family of dense subsets of B, then there exists a filter F on B such that F N Da # Ø 
for each a < k. 


4) If (P, <) is a partially ordered set satisfying ccc and |P| < k and 2 = {Da : a < k} is 
a family of dense subsets of P, then there exist a filter F on P such that F N Da 4 Ø 
for each a < kK. 


Proof: 
We are given that « be an infinite cardinal number such that « < 20, 


(1 = 2): We begin with the trivial case. Suppose X is finite. If X = {x 1, x2,...,@,} then 
every element of X is clopen and so the only dense subset of X is X. Hence the intersection 
of all dense subsets of X is X # Ø. We are done. 


Suppose now that X is an infinite set and that 7(X) denotes the set of all non-empty open 
subsets of X. Then (7(X),C) is a partially ordered set of subsets of X. Suppose X does 
not contain an uncountable family of pairwise disjoint open subsets of X. That is, suppose 
(T(X), C) satisfies the ccc. Suppose 2 = {Da : a < k} C 7(X) where Da is dense in X 
(ie., cl(Da) = X). For each a < «K, we define Ya = {U € 7(X) : cl(Ug) C Da}. (Since X 
is compact Hausdorff and none of D,’s are empty, none of the %,’s are empty.) 


We claim that, for each a, Va is a dense subset of the partially ordered set (T(X),C): 
Suppose M € 7r(X). It suffices to show that there is element of % which is a subset of M. 
See that, for any a < k, M N Da € T(X) and so there exists an element x and open set U 
such that x € U C cl(U) C MN Da C Da. Then U € Ya. We have shown that an element 
of Ya is a subset of M. So Ya is a dense subset of (7(X), C) as claimed. 


The set & = {Y%,: a < k} is then a family of dense subsets of (r(X), C) satisfying ccc. By 
Martin’s axiom, (7(X), C) contains a filter F such that F NYa # Ø for alla < «K. For each 
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a, choose Fy € FNM. Since Fa E€ Ya, then cl(Fa) C Da. Since F is a filter of non-empty 
open subsets of X which satisfies the finite intersection property, then {cl(Fy) : a < K} 
satisfies the finite intersection property inside compact X. Then there must be some a € X 
such that a € N{cl(Fy) :a< k} CN{Da:a<«}. Thus 0{Da : a < k} # Ø. This is what 
we were required to prove. 


(2 = 3): We are given that (B,<,V,A,’) is a Boolean algebra with the ccc property and 
2 = {Da : a < k} is a family of dense subsets of B.” We are required to find a B-filter, 
F, such that F N Da # Ø for alla < K. By the Stone representation theorem there exists 
a one-to-one isomorphism fg which maps B onto &(.7(B)), the set of all clopen subsets 
of the Stone space .7(B) shown to be compact and zero-dimensional. 


The Stone space F (B) satisfies the ccc: Suppose # = {Aa : a < u, Ag E T(-AY(B))} be 
a strong antichain of non-empty open subsets of Z (B) of cardinality u. We claim that 
u < No. For each a we can choose fp(aq) C Aa. Since the elements of & are pairwise 
disjoint and fg is one-to-one, then the set {fp(aa) : a < u} is of cardinality u. Since fp 
is an isomorphism then {aq : a < u} is an antichain in B of cardinality u. Since B is ccc 
then u < No. This establishes the claim. 


For each a < « choose dg E€ Da C B. If Ma = Uf{fB(da) : da € Da} we see that Ma is 
an open subset of Z (B) (since {fg(x) : x € B} is an open base for the topology 7(.7(B)) 
on .Y(B)). Suppose u is an ultrafilter in (B) — Ma and fg maps w € B to a basic open 
neighbourhood fg(w) of u. Then there exists d € Da such that 0 < d < w; this implies 
fa (d) C fg(w)N Ma. So every basic neighbourhood of u intersects Ma. Then Ma is dense 
in F (B). So {Ma :& < s} is collection of open dense subsets of the compact set Z (B). 


Our hypothesis guarantees that there exists z E€ N{M,:a< k}. Let Z, denote the set of 
all basic neighbourhoods of Z (B) which contain z. Now Z, satisfies the finite intersection 
property. Then the set F, = fp (Bz) is a B-filter. 


We claim that the subset F; in B is the B-filter we seek: Choose an arbitrary a, < K. Since 
z E€ Ma, =U{fa(da) : da € Da,} then z € fg(da) for some a. This implies da € F; N Doz. 
Then the B-filter, F, = f (Bz), intersects every dense set Da in B. This concludes the 
proof. 


(3 = 4): We are given that (P, <) is a partially ordered set such that | P| < x which satisfies 
ccc. Also suppose that the elements of J = {Da : a < k} are known to be dense subsets 
of P. We are required to show that, given our hypothesis, there exists a P-filter, #, such 
that £ N Da # Ø for all a < k. We will suppose, without loss of generality, that AP does 
not exist. (We can do this since, if we prove the existence of a P-filter, #, in P — {AP} 
then # is a P-filter in P.). If x € P we define “x!” as 


at ={yeP:y<-x} 


Consider the subset Zp = {xt : x € P}. Note that if a € a! N yl then at C x! N yt, hence 
Zp forms an open base for some topology T(P) on P. We have previously shown that 


“Recall that “Da is dense in B” means “if 0 < x € B there exists d € Da such 0 < d <r” 
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the set of all regular open subsets of P, (#o(P), C, V,N, P, Ø'^ forms a Boolean algebra in 
(T(P), €, U, N, P, Ø’) 


We claim that since P satisfies ccc then so does Zo(P) :® Let & be an antichain in #o(P) 
and A € &. Then A = intpclp(A). Since A is open in P then it is the union of 
base elements of the form gt. We can then choose a4! C A such that int pclp(a,!) C 
intpclp(A) = A. Then there is a well-defined one-to-one function h : & — P such that 
h(A) = a4 (Choice!). If A,B € # then AN B = Ø hence h(A) N g(B) = Ø. We conclude 
that h|] = {a4 E€ P: A€ &} is an antichain in P. Since P satisfies ccc then h|] must 
be countable. Since h is one-to-one, 2 must be countable. So #o(P) satisfies ccc, as claimed. 


We increase the size of the set 2 = {Da : a < K} of dense subsets of P, as follows. Let 
YP = {Da : a < K} U {Diy aye P} 


where each D(z) is a dense subset of P previously defined after definition 32.2.9 So 
every element of Y* is a dense subset of P. Since J = {Da : a < k} > |Z| < k and 
|P| < K > |U {Diy : x,y E€ P}| < r then |Y*| < x (by theorem 23.4 and 29.8). We define 
the function g : P > Zo(P) as g(x) = intpclp(x}). For œ < « and z, y € P we define 


Aa = g|Da] = {intpclp(2!) : £ € Da} 
A(z) = ID x,y)| = {int pel p(x") TE Dee ant 
Let A = {Ag : a < K}U{ Aay) : 2, y E P} C ZP). Let A be an arbitrary element in Z. 


We claim that A is dense in Zo(P). We first consider the case where A is of the form Ag. 
To see this, let H € r(P)— {Ø} such that H = intpclpH € £o(P). Then there exists t € P 
and a basic element tt € T(P) such that t € t! C H. Since Da is dense in P there exists 
d € Da such that d < t. So intpelp(d!) C intpclp(t!) C intpelpH = H. We have found 
an element intpclp(d!) in A such that int pclp(d!) C H. So A is dense in #o(P). If A is of 
the form A(z, y) the proof that A is dense proceeds identically. So ¥ intersects every set in &. 


By hypothesis, 4o(P) contains a filter, F, such that F N A # Ø for all Ac æ. Let 
g : P — o(P) be defined as above. That is, g(x) = int pclp(x!) and let 


KH =g |F] = {xE P: intpclp(x!) E€ F} 


The set # intersects all elements of 2*: Consider Dg € Y*. Then g[Dg] = Ag € X. 
There must exist u € F N Ag. Then g“ (u) € Dg N H. We now consider Diz) E€ 2* for 
some x, yin P. Then g[Diey)| = Avy) E Z. There must exist u E€ F N A(z y). Then 
g7 (u) E€ Day 1H. Then # meets all elements of 2*, as claimed. 


®The reader is left to verify the following facts: 1) If A C B then intxclx(A) C intxclx(B) 2) 
intxclx (A)N intxclx(B) = intxclx (A A B) 

The subset Dís) is previously defined in example 2 following the definition 32.2 as: 
Dey) = Via,y) U Wea,y) where Vizy) = {2 € P : z is not compatible with x or with y} and Wizy) = rt Nyt. 

10The reader is left to verify that, if A is a subset of X then intxclxA is regular open in X. 
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We claim that # is a proper P-filter: The set # is non-empty since F N A, #4 Ø. The 
set # does not contain a minimum element since P was assumed not to have one, hence 
KH + P. Suppose x € # and x < y. Then g(x) = intpclp(x!) € F and, since F is a 
So(P)-filter and g(x) C gly) = intpclp(y!), then g(y) C F, hence y € #. 

Let xz, y E€ #. We are now required to find q E€ # such that q < x and q < y. Since 
x, y E # then g(x), gly) E€ F. Since Day 1H # Ø there exists s E Diy) N #. 
Then g(s) € ¥. It is not possible that s! N zl = @, for if it was, we would have 
Ø = g(s! N xt) = g(s!) N g(x!) € F, a contradiction. Also it is impossible that s! N y! = Ø 
for this would not allow g(s) and g(y) to both belong to F. Then s € x! Nyl. Then s < x 
and s < y. We conclude that # is a P-filter. 


This concludes the proof of 3) = 4). 


(4 => 1): Suppose (P <) be a partially ordered set satisfying ccc. Let 2 = {Da : a < k} be 
a set of dense subsets of P. We will assume that P does not have a minimum element. Since 
Da is dense in P, for each x € P there exists q E€ Da such that q < x; hence £t N Da 4D 
for all x € P. For each a we define a function ga : P —> Da as 


Jo(x) = Za where £a € x! A Da (Choice!) 
We also define the function g : P x P — Pas 
g(x,y) =z where z € x! Nyl, g(x,y) arbitrary if rt Nyl AB (Choice!) 


Let To = {k} for some k € P. We inductively define the elements of {Tn : n < No} as 
follows: 
Tni = Tn U g[Tn X Th] U [U{ga|Tn]: @ < K} 


and let T = U{T, : n < No}. Since T C P, T inherits “<” from P to form a par- 
tially ordered set (T, <). Also note that since |{T, : n < No}| < Xo and |Th| < « then 
|T| =| U {Tn : n < Xo}| < No x k =k. Also see that go[Tn] C Tn4i C T for all n and so 
JalT] C T. Similarly, g[T x T] C T. 


Let A = {aa : a < A} be an antichain in T C P. Then aat N apt = Ø in T if a £ $. 
If p € P — T such that p € aal Nag! then glaa ag) = k € aat Nag!. Since k € T then 
aat N agl # Ø in T, a contradiction. Then aat N agl = Ø in P. We conclude that A is an 
antichain in P. Since P satisfies ccc then o cannot be uncountable. Therefore T satisfies 
CCC. 


We claim that, for any given a, TA Da is dense in T: Let t € T and a < k. Since Da is 
dense in P there exists z € Da such that z < t € T. Then galz) = za € 2! N Da. Then 
Za € T and za < z < t. Since za € TM Da we can conclude that T A Da is dense in T, as 
claimed. 


By hypothesis, there exists a T-filter, F, such that F N (T N Da) # Ø for alla < «K. If 
q € P let {q} = {x € P : x > q} be the principal P-filter generated by q. We let 
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F* = U{{q}! : q € F}. It easily verified that F* is a P-filter which contains F. Since 
F C F* and F intersects every Da then so does F*. Then F* is P-filter we were required 
to find. 


We finally have the following topological statement which is equivalent to Martin’s 
axiom. 


Theorem 0.14 Let x be a cardinal such that No < k < 280. Let X be a Hausdorff 
topological space satisfying ccc such that {x € X : æ has a compact neighbourhood} is 
dense in X. Suppose that 2 = {Da : a < k} is a family of dense open subsets of X. Then 
N{Da : a < k} is dense in X if and only if Martin’s axiom holds true. 


Proof: 


(= ) Let X be a compact Hausdorff space which satisfies ccc where X contains a family of 
dense open subsets of X, 2 = {Da : a < k}. Since X is compact every point x € X contains 
a compact neighbourhood. By hypothesis, N{Da : a < k} is dense in X, so N{Da:a < K} 
is not empty. Then by 2) & 1) in the previous theorem, Martin’s axiom holds true. 

( = ) Suppose Martin’s axiom holds true. Let X be a Hausdorff topological space satisfying 
ccc such that T = {x € X : x has a compact neighbourhood} is dense in X. Suppose that 
2 = {Da : a < k} is a family of dense open subsets of X (where Xo < «k < 20), We are 
required to show that N{Da : a < K} is dense in X. For any non-empty open subset U 
there exists a point x E€ U NT and some open neighbourhood S of x with compact closure, 
cly S, such that x € clx(SNU) C clxS. Since X satisfies ccc its compact subset cly (SNU) 
must also satisfy ccc. For any Da € 2, Da NUNS is open and dense in cly(S N U). By 
the topological equivalent form of MA, there exists q E€ N{DaNUNS:a < k}. Since 
MDaNUAS:a< Kk} CUN(MNDe: a < «K}) then q E N{Da:a< K} NU. Not 
only is O{Da : a < K} non-empty but it also intersects every open subset U of X. So 
{Da : a < k} is dense in X. 
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Appendix B : List of definitions and statements. 


I Axioms and classes : 1 / Classes, sets and axioms 
Axiom A1 (Axiom of extent) : For the classes z, A and B, [A = B] & |r € A x € B] 


Axiom A2 (Axiom of class construction): Let P(x) designate a statement about x which 
can be expressed entirely in terms of the symbols €, V, A, =, >,V, brackets and vari- 
ables z, y, z, ..., A, B, ... Then there exists a class C which consists of all the 
elements x which satisfy P(x). 


Axiom A3 (Axiom of pair) : If A and B are sets , then the doubleton {A, B} is a set. 


Axiom A4 (Axiom of subsets) : If S is a set and ¢ is a formula describing a particu- 
lar property, then the class of all sets in S which satisfy this property @ is a set. 
More succinctly, every subclass of a set of sets is a set. (Also called the Axiom of 
comprehension, Axiom of separation or Axiom of specification). 


Axiom A5 (Axiom of power set): If A is a set then the power set P(A) is a set. 
Axiom A6 (Axiom of union): If æ is a set of sets then Uccey C is a set. 


Axiom A7 (Axiom of replacement): Let A be a set. Let (x,y) be a formula which 
associates to each element x of A a set y in such a way that, whenever both ¢(z, y) 
and ¢(a, z) hold true, y = z. Then there exists a set B which contains all sets y such 
that (x,y) holds true for some x € A. 


Axiom A8 (Axiom of infinity): There exists a non-empty set A that satisfies the condition: 
“X € A” => “X U{X} e A”. (A set satisfying this condition is called a successor set 
or an inductive set.) 


Axiom A9 (Axiom of regularity) Every non-empty set A contains an element x whose 
intersection with A is empty. 


Axiom of choice : For every set & of non-empty sets there is a rule f which associates 
to every set A in æ% an element a € A. 


I Axioms and classes : 2 / Constructing classes and sets 
Theorem 2.1 For any class C, C=C. If it is not true that A = B we will write A £ B. 


Definition 2.2 If A and B are classes (sets) we define A C B to mean that every element 
of A is an element of B. That is, AC Biffe ce A=>xeBIf AC B we will say 
that A is a subclass (subset) of B. If AC Band A Æ B we will say that A is a proper 
subclass (proper subset) of B and write A C B when we explicitly want to say A 4 B. 
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Theorem 2.3 If C, D, and E are classes (sets) then: 
a) c= 0: 
b) C=D>D=C. 
c) C= D and D= E> C=D. 
d) CC Dand DCCs C=D. 
) 


e CC DandDCESCCE. 


Theorem 2.4 There exists a class which is not an element. 


Definition 2.5 The Axiom 2 states that C = {x : x # x} is a class. It contains no 
elements. We will call the class with no elements the empty class and denote it by Ø. 


Theorem 2.6 For any class C, Ø C C. 


Theorem 2.7 Let S be a set. Then: 


a) Ø C S and so Ø is a set. 
b) The set S is an element. Hence all sets are elements. 


Definition 2.8 If A is a set then we define the power set of A as being the class Y(A) of 
all subsets of A. It can be described as follows: A(A) = {X : X C A}. 


II Class operations 3 / Operations on classes and sets 


Definition 3.1 Let A and B be classes (sets). We define the union AU B of the class A 
and the class B as 
AUB={a:(x€ A) V (xe B)} 


That is, the element x € AUBiffx € Aoraz e€ bB. If æ is a non-empty class of 
classes then we define the union of all classes in & as 


U C = {x : x € C for some C € #} 
CEA 


That is, the element x € Uccy C iff there exists C € & such that x € C. 


Definition 3.2 Let A and B be classes (sets). We define the intersection A N B of the 
class A and the class B as 


ANB={zr:(xE€ A)A(xeE B)} 


That is, the element x € AN B iff x € A and z € B. If æ is a non-empty class of 
classes then we define the intersection of all classes in & as 

() C={x:x€C foral C € a} 

CEA 


That is, the element x € (\cc 4 C iff x € C for every class C in æ. 
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Definition 3.3 We will say that two classes (sets) C and D are disjoint if the two classes 
have no elements in common. That is, the classes C and D are disjoint if and only if 
CND=f: 


Definition 3.4 The complement, C’, of a class (set) C is the class of all elements which 
are not in C. That is, if C is a class, then 


CG ={x:x g C} 


Hence x € C' iff x ¢ C. Given two classes (sets) C and D, the difference C — D, of C 
and D, is the class 
C-D=CnD 


The symmetric difference, CAD, is the class 
CAD = (C— D)U(D-C) 


Theorem 3.5 Let C and D be classes (sets). Then, 
a) CC CUD 
b) CNDCC 
Theorem 3.6 Let C and D be classes (sets). Then, 
a) CU(CND)=C 
b) CN(CUD)=C 
Theorem 3.7 Let C be a class (a set). Then (C’)’ = C. 


Theorem 3.8 DeMorgan’s laws. Let C and D be classes (sets). Then, 
a) (CUD/=C'nND' 
by {Cn Dy =C Up! 
Theorem 3.9 Let C, D and E be classes (sets). Then, 
a) CU D = DUC and CN D = DNC (Commutative laws) 
b) CUC = C and CNC = C (ldempotent laws) 
c) CU(DUE)=(CUD)UE and CN (DN E) = (CN D)OE (Associative laws) 
d) CU(DNE) = (CUD)N (CUE) and CN (DUE) = (CN D)U (CN E) (Distribution) 


Theorem 3.10 Let A be a class (a set) and Y denote the class of all elements. 


a) YWUA=Y 
b) ANY=A 
c) Y'= 

d) gv =Y 

e) AUF =U% 
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Theorem 3.11 Let & be a non-empty class (set). 
/ 
a) (Ucew ©) = Noer C 
/ 
b) (New C) =Ucew Cc 
Theorem 3.12 Let D be a class and & be a non-empty class (set) of classes. 


a) DN (Ucew C) =Ucew(DNC) 
b) DU (Neew C) =Neew(DUC) 


Theorem 3.13 Let {Bu j) : i= 1,2,3,..., j = 1,2,3, ...} be aset of sets Then UZ (NFL Be,5)) = 
NP (UR Baz): 


II Class operations 4 / Cartesian products 


Definition 4.1 Let c and d be elements. We define the ordered pair (c,d) as (c,d) = 
{{c}, {e, dy}. 


Theorem 4.2 Let a,b,c and d be classes (which are elements). Then (a,b) = (c,d) iff 
a=cand b= d. 


Alternate definition 4.3 If cand d are classes define (c, d) as follows: (c, d) = { {c, Ø}, {d, {@}} }. 


Definition 4.4 Let C and D be two classes (sets). We define the Cartesian product, C x D, 
as follows: C x D = { (c,d): c € C and d E€ D}. 


Lemma 4.5 Let C and D be two classes (sets). Then the Cartesian product, C x D, of C 
and D satisfies the property: C x DC A(A(CUD)). 


Corollary 4.6 If C and D are classes, then the Cartesian product, C x D, is a class. If C 
and D are sets, then C x D is a set. 


Theorem 4.7 Let C, D, E and F be a classes. Then 


a) Cx (DN BE) =(C x D)N (Cx E) 
b) Cx (DUE) =(Cx D)U(Cx E) 
c) (CN E)x D=(Cx D)N(Ex D) 
d) (CUE)x D=(C x D)U (Ex D) 
e) (CUD) x (EUF) =(C x E)U(Dx E)U(Cx F)U(Dx F) 
f) (CON D)x (ENF)=(Cx E)N (Dx E)N (Cx F)N (Dx F) 


Theorem 4.8 If CC Dand EC F,thenC x ECD*xF. 


Theorem 4.9 Given three classes (sets) S, U and V there is a one-to-one correspondence 
between the two classes (sets) S x (U x V) and (S x U) x V. 
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Theorem 4.10 For classes c, d, e and f, if (c,d) = {{c, Ø}, {d, {@}}} and (e, f) = {{e, Ø}, 
{f,{G}}} then (c,d) = (e, f) if c =e and d= f. 


III Relations 5 / Relations on a class or set 


Definition 5.1 


a) We will call any subset R of ordered pairs in Y x YW a binary relation. 
b) We will say that R is binary relation on a class C if R is a subclass (subset) of 
C x C. In such cases we will simply say that R is a relation in C or on C. 


c) If A and B are classes (sets) and R is a subclass (subset) of A x B then R can 
be viewed as a relation on AU B. 


Definitions 5.2 Let C be a class (a set). a) The relation €c= {(z,y): 2 E€ C,y E C,xre€ y} 
is called the membership relation on C. b) The relation 


Idc = {(x,y) £E C, y€ C, x = y} 
is called the identity relation on C. 


Definitions 5.3 Let R be a relation on a class (set) C. The domain of R is the class, 
dom R = {x : x € C and (x,y) € R for some y € C}. The image of R is the class, 


im R = {y : y € Cand (z,y) € R for some x € C}. The word range of R is often 
used instead of “the image of R”. If R C A x B is viewed as a relation on AU B, then 
dom R C A andim R C B. 


Definition 5.4 Let C be a class (a set) and let R be a relation defined in C. The inverse, 
Rt, of the relation R is defined as follows: 


R = {(z,y): (y, £) € R} 


Definition 5.5 Let C be a class (a set) and let R and T be two relations in C. We define the 
relation ToR as follows: ToR = {(x, y) : there exists some z € im R such that (x, z) € 
R and (z,y) ET} 


III Relations 6 / Equivalence relations and order relations. 
Definition 6.1 Let S be a class and R be a relation on S. 
a) We say that R is a reflexive relation on S if, for every x € S, (a,x) € R. 


$ 
b) We say that R is a symmetric relation on S if, whenever (x,y) € R then (y, x) € R. 


c) We say that R is an anti-symmetric relation on S if, whenever (x,y) € R and (y, x) € R 
then x = y. 


d) We say that R is an asymmetric relation on S if, whenever (x,y) € R then (y, x£) £ R. 
e) We say that R is a transitive relation on S if, whenever (x,y) € R and (y, z) € R then 
(x,z) ER. 
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f) We say that R is an irreflexive relation on S if, for every x € S, (a,x) g R. 


g) We say that R satisfies the property of comparability on S if, for every x,y € S where 
xz #4 y, either (x,y) € Ror (y, x) € R. 


Definition 6.2 Let S be a class and R be a relation on S. We say that R is an equivalence 
relation on S if R is simultaneously 1) reflexive, 2) symmetric and 3) transitive on S. 


Definition 6.3 Let S be a class. 
a) Non-strict order relation. The relation R is a non-strict order relation on S if 
it is simultaneously reflexive (aRa holds true for any a in S), antisymmetric (if 
aRb and bRa then a = b) and transitive (aRb and bRc implies aRc) on S. A 
non-strict order relation, R, on S is said to be a non-strict linear order relation 
if, for every pair of elements a and b in S, either (a,b) € R, (b,a)€ Rora=b. 
That is, every pair of elements are comparable under R.'! A non-strict ordering, 


R, on S which is not linear is said to be a non-strict partial ordering relation on 
512 


b) Strict order relation. The relation R is a strict order relation if it is simultane- 
ously irreflexive ((a,a) ¢ R), asymmetric ((a, b) E€ R = (b,a) ¢ R) and transitive 
on S. If every pair of distinct elements, a and b, in S are comparable under a 
strict order relation, R, then R is a strict linear ordering on S. Those strict 
orderings which are not linear are called strict non-linear orderings or, more 
commonly, strict partial ordering relation. 

A non-strict partial order R on S always induces a strict partial order R* by 
defining aR*b = [aRb and a ¥ b]. Similarly, a strict partial order R on S always 
induces a non-strict partial order Rt by defining aRtb > [aRb or a = b]. 


Definition 6.4 Let S be a class and R be is a partial ordering or a strict ordering relation 
on S. If R is a partial ordering relation (a,b) € R is represented as a < b and if R is 
a strict ordering relation (a,b) € R is represented as a < b. Ifa < b and a Æ b, we 
will simply write a < b. 
a) A subset of S which is linearly ordered by R is called a chain in S. If R linearly 
orders S then S is a linearly ordered subset of itself and so is a chain. 


b) An element a of S is called a mazimal element of S if there does not exist an 
element b in S such that a < b. An element a of S is called a minimal element 
of S if there does not exist an element b in S such that b < a. 


c) An element m in S is called the minimum element of S if m < a (m < a) for 
alla € S. An element M in S is called the mazimum element of S if a < M 


11A class on which is defined a linear ordering R is also said to be fully ordered or totally ordered by R. 
In certain branches of mathematics “linearly ordered set” is abbreviated as l.o.set or simply called loset. 

127 certain branches of mathematics “partially ordered set” is abbreviated as p.o.set or simply called a 
poset 
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(a< M) for allae S. 


III Relations 7 / The partition of a set induced by an equivalence relation. 


Notation 7.1 Let R be an equivalence relation on a set S and let x € S. Then the set Sy 
is defined as follows: S+ = {y : (x,y) € R}. That is, S, is the set of all elements y in 
S such that y is related to x under R. 


Theorem 7.2 Let R be an equivalence relation on a set S. Let x and y be two elements 
in S which are not related under R. Then any element z in S which is related to x 
cannot be related to y. 


Theorem 7.3 Let R be an equivalence relation on a set S. Let x and y be two elements 
in S which are not related under R. Then Sg N Sy = Ø. 


Theorem 7.4 Let R be an equivalence relation on a set S. Let x and y be two elements 
in S which are related under R. Then S; = Sy. 


Theorem 7.5 Let R be an equivalence relation on a set S. For every x € S there exists 
some y € S such that x € Sy. 


Theorem 7.6 Let R be an equivalence relation on a set S. Then Upes Sz = S. 


III Relations 8 / On partitions and quotient sets of a set. 


Definition 8.1 Let S be a set. We say that a set € C P(S) forms a partition of S if € 
satisfies the 3 properties: 


2) IfAandBeE@andAABthn AN B = Ø. 
3) A#@ forall Ae @. 


Definition 8.2 Let S be a set on which an equivalence relation R is defined. 

a) Each element S, of Fr = {Sz : x E€ S} is called an equivalence class of x under R or 
an equivalence class induced by the relation R. 

b) The set Ye = {Sz : x E€ S} of all equivalence classes induced by the relation R is called 
the quotient set of S induced by R. The set Yr is more commonly represented by the 
symbol S/R. So S/R = {Sz : x E€ S}. From here on we will use the more common 
notation, S/R. 


Theorem 8.3 Let S be a set and @ be a partition of S. Let Rg be the relation such that 
(x,y) € Re iff {x,y} C S for some S in @. Then Rg is an equivalence relation on S. 


IV Functions 9 / Functions: A set-theoretic definition. 


412 List of definitions and statements. 


Definition 9.1 A function from A to B is a triple (f, A, B) satisfying the following prop- 
erties: 
1) A and B are classes and f C Ax B 
2) For every a € A there exists b € B such that (a,b) € f. 
3) If (a,b) € f and (a,c) € f then a = c. 


Definition 9.2 If f : A — B is a function and D C A then we say that the function 
f: D — Cis a restriction of f to D. In this case we will use the symbol f|p to 
represent the restriction of f to D. Note that, if D C A, then we can write flp C f 
since f|p = {(z,y): a € D and (x,y) E f} Cf. 


Theorem 9.3 Let f : A — B be a function and suppose A = C U D. Then f = f|-U flp. 


Theorem 9.4 Two functions f : A— B and g : A —> B are equal if and only if f(a) = g(x) 
for alla € A. 


Definitions 9.5 Let f : A — B be a function. 

a) We say that “f maps A onto B” if im f = B. We often use the expression “f : A —> B 
is surjective” instead of the words onto B. 

b) We say that “f maps A one-to-one into B” if whenever f(x) = f(y) then z = y. We 
often use the expression “f : A — B is injective” instead of the words one-to-one into 
B. 

c) If the function f : A — B is both one-to-one and onto B then we can simply say that 
f is “one-to-one and onto”. Another way of conveying this is to say that f is bijective, 
or f is a bijection. So “injective + surjective => bijective”. 

d) Two classes (or sets) A and B for which there exists some bijective function f : A > B 
are said to be in one-to-one correspondence. 


IV Functions 10 / Compositions of function. 


Definition 10.1 Suppose f : A — B and g : B — C are two functions such that the image 
of the function f is contained in the domain of the function g. Let h = {(z,z): y = 


f(x) and z = g(y) = g(f(x)) }. Thus (a, z) € h if and only if (x, z) = (x, g(f(z)). 
We will call h the composition of g and f, and denote it as go f. 


Theorem 10.2 Let f: A— Band g : B — C be two functions such that the image of 
the function f is contained in the domain of the function g. Then the composition of 
g and f, (go f): A—C , is a function. 


Theorem 10.3 Let f: A — B, g: B — C and h : C — D be three functions. Then 
ho (go f)=(hog)o f. 


Theorem 10.4 Let f: A — B. Then Igo f = f and fo Ia = f. 
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Definition 10.5 Let f : A— B. If g : B — A is a function satisfying g o f = I4 then we 
will call g an “inverse of f” and denote it as f~!. 


Theorem 10.6 Let f : A — B be a one-to-one onto function. 


a) An inverse function f~!: B — A of f exists. 

b) The function f~! is one-to-one and onto. 

c) The function f~! : B — A satisfies the property f o f~! = Ip. 
) 


d) The inverse function, f~', of f is unique. 


Definition 10.7 A function f : A — B which is one-to-one and onto is called an invertible 
function. 


Theorem 10.8 Let f : A — B and g : B — C be two onto-to-one and onto functions. 


a) The function go f is also one-to-one and onto. 
b) The inverse, go f, is (go f)"! = ft o g7! 


IV Functions 11 / Images and inverse images of sets. 


Definition 11.1 Let A and B be sets and suppose f : A — B is a function acting on A. If 
S is a subset of A = dom f we define the expression f[S] as follows: f[S] = {y : y = 
f(x) for some x € S}. We will say that f[S] is the image of the set S under f. 


Definitions 11.2 Let f : A — B be a function where A and B are sets. We define 
f€ : P(B) — YP(A) as fC (X) = VY iff Y = {y: ye A, f(y) € X}. In particular, 
fUr} = Y iff Y = {y : y € A, f(y) = x}. We will refer to it as the set-valued 
inverse function fT 


Theorem 11.3 Let f : A — B bea function. Then f~ : A(im f) —> A(A) is a one-to-one 
function on its domain Z (im f). 


Theorem 11.4 Let f : A — B be a function mapping the set A to the set B. Let £ bea 
set of subsets of A and & be a set of subsets of B. Let D C A and E C B. Then: 


a) f [sew S| = Usew f [S] 

b) f [Asew S] © Nsew f [S] with equality only if f is one-to-one. 

c) f [A — D] C B — f[D] with equality only if f is one-to-one and onto B. 
d) f7 (Useg S) = User F7 (8) 

e) fT Nsez S) = see fo (8) 

f) a (B-E)=A- f7 (E) 


IV Functions 12 / Equivalence relations defined by functions. 
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Definition 12.1 Let f : A — B be a function which maps a set A into a set B. We define 
an equivalence relation Ry on A as follows: Two elements a and b are related under Ry 
if and only if {a,b} C f(a) for some z in im f. The quotient set of A induced by Ry 
is then A/Rp = We, = {f7 ({x}) : x € f[A]} We will refer to Ry as the equivalence 
relation determined by f and A/Ry (or &p,) as the quotient set of A determined by f. 


Theorem 12.2 Let f : S — T be a function where S and T are sets. There exists an 
onto function g, : S — S/Ry and a one-to-one function h, : S/Rf — T such that 
h,og, =f. The function, h, og, = f, is called the canonical decomposition of f. 


V From sets to numbers 13 / The natural numbers. 
Definition 13.1 For any set x, we define the successor xt, of z as vt = x U {x}. 


Definition 13.2 If x is a set then x? = x U {æ}. A set A is called an inductive set if it 
satisfies the following two properties: 


a) BEA. 
b) re ASatead. 


Definition 13.3 We define the set of all natural numbers, N, as the intersection of all 
inductive sets. That is N = {x: x € I for any inductive set J}. 


Theorem 13.4 Let A be a subset of N. If A satisfies the two properties: 


a) OEA 
b) meASmteA 


then A=N. 


Corollary 13.5 (The Principle of mathematical induction.) Let P denote a particular set 
property. Suppose P(n) means “the property P is satisfied depending on the value of 
the natural number n”. Let 


A = {n € N: P(n) holds true } 


If A satisfies the two properties: 


a) 0 € A. That is P(0) holds true, 
b) (n € A) > (n* € A). That is, P(n) holds true > P(n*) holds true. 


then A =N. That is, P(n) holds true for all natural numbers n. 


Definition 13.6 A set S which satisfies the property “x € S > x C S” is called a transitive 
set. 


Appendix B 415 


Theorem 13.7 The non-empty set S is a transitive set if and only if the property “x € y 
and y E S” => “xe S”. 


Theorem 13.8 The set N of natural numbers is a transitive set. 


Theorem 13.9 a) For natural numbers n,m, m E€ n = m C n. Hence every natural 
number is a transitive set. 
b) For any natural number n, n Æ nt. 
c) For any natural number n, n g n. 


d) For any distinct natural numbers n,m, MCN SMEN. 


Theorem 13.10 Let m and n be distinct natural numbers. 


a) If m Cn then mt Cn. 


b) All natural numbers are comparable. Either m C n or n C m. Equivalently, 
meéenornem. Hence both “C” and “e” linearly order N. 


c) There is no natural number m such that n C m C nt. 


Theorem 13.11 Every natural number has an immediate predecessor. If k and n are 
natural numbers such that kt = n then k is called an immediate predecessor of n. For 


any non-zero natural number n, k = Umen ™ is an immediate predecessor of n. 


Theorem 3.12 Unique immediate predecessors. Any non-zero natural number has a unique 
immediate predecessor. 


Theorem 3.13 (The Principle of mathematical induction: second version.) Let P denote 
a particular property. Suppose P(n) means “the property P is satisfied depending on 
the value of the natural number n”. Let 


A = {n € N: P(n) holds true } 
Suppose that, for any natural number n, 
P(k) is true for all k < n => P(n) is true 


Then P(n) holds true for all natural numbers n. 


V From sets to numbers 14 / The natural numbers as a well-ordered set. 


Notation 14.1 We define the relation “E€_” on N as follows: 
m E- nif and only f m=normen 


If m €- n and we want to state explicitly that m #4 n we write m € n. 
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Theorem 14.2 Let (S,<) be a linearly ordered set. Suppose T C S. The element q is 
a least element of T with respect to “<” if q E T and q < m for all m € T. If S is 
equipped with a strict linear ordering “<” a least element of T with respect to < is 
an element q € T such that q < m for all m € T where m # q. The set (S,<) is said 
to be well-ordered with respect to “<” if every non-empty subset T of S contains its 
least element with respect to <. Similarly, the set (.S,<) is said to be well-ordered 
with respect to “<” if every non-empty subset T of S contains its least element with 
respect to <. 


Theorem 14.3 The natural numbers N is a strict €-well-ordered set. 
Corollary 14.4 Every natural numbers n is a €-well-ordered set. 
Theorem 14.5 Any bounded non-empty subset of (N, €) has a maximal element. 


Definition 14.6 Consider the set {1,2} of all functions mapping natural numbers to 1 or 
2. We define the lexicographic order “<” on {1,2} as follows: For any two elements 
f = {ao, a1, a2,43,...,} and g = {bo, b1, b2, b3,..., } in {1,2}, f = g if and only if 
a; = 6; for alli € N and f < g if and only if for the first two unequal corresponding 
terms a; and b;, a; € bi. A lexicographic ordering can similarly be defined on SN 
where S$ is any subset of N. 


V From sets to numbers 15 / Arithmetic of the natural numbers. 


Definition 15.1 Let m be a fixed natural number. Addition of a natural number n with 
m is defined as the function rm : N —> N satisfying the two conditions 


Tm O) = m 


Tm(n*) = [rm(n)]* 
The expression m + n as simply another way of writing rm(n). Thus 
Tm(0) =m & m+0=m (5) 


Tmn”) = [rm] & m+tnt=(mtn)t (6) 


Theorem 15.2 Let m be a fixed natural number and let rm : N — N be a function 
satisfying the two properties 


mnm 
3 
3 
e 
| 
3 


Then rm is a well-defined function on N. 
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Definition 15.3 For any natural number m, multiplication with the natural number m is 
defined as the function sm : N —> N satisfying the two conditions 


Sm(0) = 0 


Sm(n™) = sm(n)+m 


We define the expression mn and m x n as alternate ways of writing sm(n). Thus 








Sm(0)=0 m=mix0=—0 (7) 


Sm(n*) = smn) +m & mnt =mnt+mamxntm (8) 


Theorem 15.4 Let m be a fixed natural number and let sm : N — N be a function 
satisfying the two properties 


oe z 0 


Sm(n Sm(n) +m 
Then sm is a well-defined function on N. 


Theorem 15.5 For any two natural numbers m and n, m E€- n if and only if there exists 
a unique natural number k such that n = m + k. 


Definition 15.6 For any two natural numbers m and n such that m < n, the unique nat- 
ural number k satisfying n = m + k is called the difference between n and m and is 
denoted by n — m. The operation “—” is called subtraction. 


V From sets to numbers 16 / The integers Z and the rationals Q. 


Theorem 16.1 Let Z = Nx N. Let R, be a relation on Z that is defined as follows: 
(a, b)R.(c, d) if and only if a +d = b+ c. Then R, is an equivalence relation on Z. 


Corollary 16.2 Let Z = N x N be equipped with the equivalence relation R, defined as: 
(a,b) R,(c,d)Sa+d=b+ec 


For each n € N let [(0,n)] and [(n,0)] denote the R,-equivalence classes containing 
the elements (0,7) and (n,0) respectively. Then the quotient set induced by R, can 


be expressed as ZR, = {[(0,n)] : n € N} U {[(n, 0)] : n € N} 
Definitions 16.3 The set of integers, Z, is defined as: 


Z= Z/R, = {[(a,b)] : a,b € N} = {[(0, n)] : n € N} U {[(n,0)] : n € N} 
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a) Negative integers: The set of negative integers is defined as being the set 
Z = {[0,n)]:n € N} 
Positive integers: The set of positive integers is defined as being the set 
Zt = {[(n,0)]:n E N} 


The elements of the form |(0,n)] can be represented by —n = [(0,n)] while the 
elements of the form |(n, 0)] can be represented as n = [(n, 0)]. 


b) Order relation on Z: We define a relation <, on Z as follows: [(a,b)] <- [(c, d)] 
if and only ifa +d < b+c. It is a routine exercise to show that <, is a linear 
ordering of Z. 


c) Addition on Z: We must sometimes distinguish between addition of natural 
numbers and addition of integers. Where there is a risk of confusion we will 
use the following notation: “+n” means addition of natural numbers while “+,” 
means addition of integers. 

Addition +, on Z is defined as: 


[(a, b)] +2 [(c, d)] = [(a +n c, b +n d)] 
d) Opposites of integers: The opposite —[(a, b)] of [(a, b)] is defined as 
—[(a, b)] = [(b, a)]' 


e ” 


e) Subtraction on integers: Subtraction “—,” on Z is defined as: 
[(a, b)] -z [(c, d)] = [(a, b)] + ([(e, @)])? 

f) Multiplication of integers: Multiplication x, on Z is defined as 
[(a, b)] xz [(e, d)] = [(ac + bd, ad + be)].3 


In particular, [(0, n)] xz [(m, 0)] = [(0+0,0+nm)] = [(0,nm)] = —[(nm, 0)] and 
[(n, 0)] xz [(m, 0)] = [(nm, 0)]. 

Absolute value of an integer: The absolute value, |n|, of an integer n is defined 
as 


ee} 
Na” 


ae n if O<,n 
~ l-n if n<,0 








‘Note that —[(n,0)] = [(0,7)] n. 

?When there is no risk of confusion with subtraction of other types of numbers we will simply use 

3Note that the “center dot” can be used instead of the “x! symbol. When there is no risk of confusion 
with multiplication of other types of numbers we will simply use “x”. 


“n 
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h) Equality of two integers: If (a,b) and (c, d) are ordered pairs which are equivalent 
under the relation Rz, then the R,-equivalence classes |(a, b)| and [(c, d)] are equal 
sets. To emphasize that they are equal sets under the relation R, we can write 


[(a, b)] =z [(c, d)] 
i) Distribution properties: If [(a,b)], [(c, d)] and [(e, f)] are integers then 
[(a, b)] xz ([(c, d)] +2 [(e, f)]) =z [(a, b)] xz [(c, d)] +z [(a, b)] xz [le f)] 
and 
(Ie, d)] +2 [(e, f)]) xz [(a, b)] =z [(c, d)] xz [(a, b)] +2 [(e, f)] xz [(a, b)] 


Theorem 16.4 Let Q = Z x Z* where Z* = Z — {0}. Let R, be a relation on Q defined as 
follows: (a,b) R,(c, d) if and only ifax,d=bx,c. Then R, is an equivalence relation 


on Q. 
Definitions 16.5 The set of rational numbers, Q, is defined as: 
Q = Q/R; = {[(a,b)] : a € Z, be Z*} 
The expression [(a, b)] is normally written in the form $. 


a) We define a relation <q on Q as follows: If b and d are both positive, [(a, b)] <4 
[(c, d)] if and only if a x; d <; b Xz c. 


b) Addition +4 on Q is defined as: 
[(a, b)] +; [(c, d)] = [(ad +z bc, b xz d)| 
c) Subtraction —, on Q is defined as: 
[(a, b)] —4 [(c, d)] = [(a, b)] +4 [(-e, d)] 
d) Multiplication x4 on Q is defined as 
[(a, b)] xq [(c, d)] = [(a xz c, b xz d)] 


e) Equality of two rational numbers: If (a,b) and (c, d) are ordered pairs of integers 
(b, d € 0) which are equivalent under the relation Rq, then the R,-equivalence 
classes [(a, b)] and [(c, d)] are equal sets. To emphasize that they are equal sets 
under the relation Rg we can write 


[(a, b)] =¢ [le d)] 


‘Recall that a and b is shorthand for expressions of the form [(0,a)] or —[(0,b)] 
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f) Opposites of rational numbers. If (a,b) is an ordered pair of integers (b # 0) and 
[(a, b)] is its Rg-equivalence class then the opposite of the rational number [(a, b)] 
is defined as [(—a, b)] and is denoted as —|(a, b)] =g [(—a, b)]. 


Theorem 16.6 Suppose a and b are positive integers where b # 0 and [(a,b)] is an Ry 
equivalence class. Then 


V From sets to numbers 17 / Dedekind cuts: “Real numbers are us!” 


Definition 17.1 For any real number r, let pS; denote the interval (—oo,r) in R. It is 
the subset of all real numbers strictly smaller than r. The subset pS, is called an 
initial segment in R. For any real number r, the subset QSr = (—0o0o, r)NQ= rRS-NQ 
is called an initial segment in Q. For each of pS; and QS; the real number r is the 
least upper bound of (—oo,r) and (—œ,r) N Q respectively. Note that r may be a 
non-rational number even for gS; a proper subset of Q. 


Definition 17.2 The elements gS, = (—oo, r) 1 Q of the set J = {QSr : r € R} are called 
Dedekind cuts 


Definition 17.3 The set of all Dedekind cuts J, linearly ordered by inclusion with addition 
+ and multiplication x as described above is called the real numbers. Those Dedekind 
cuts which do not have a least upper bound in Q are called irrational numbers. 


Lemma 17.4 The union of a set of Dedekind cuts is either Q or a Dedekind cut. 


Theorem 17.5 Every non-empty subset of R which has an upper bound has a least upper 
bound. 


VI Infinite sets 18 / Infinite sets versus finite sets. 


Definition 18.1 A set S is said to be an infinite set if S has a proper subset X such that 
a function f : S — X maps S one-to-one onto X. If a set S is not infinite then we 
will say that it is a finite set. 


Theorem 18.2 Basic properties of infinite and finite sets. 


The empty set is a finite set. 

Any singleton set is a finite set. 

Any set which has a subset which is infinite must itself be infinite. 
Any subset of a finite set must be finite. 


Appendix B 421 

Theorem 18.3 Let f : X — Y bea one-to-one function mapping X onto Y. The set Y is 
infinite if and only if the set X is infinite. 

Corollary 18.4 The one-to-one image of a finite set is finite. 

Lemma 18.5 Let S be an infinite set and x € S. Then S — {x} is an infinite set. 

Theorem 18.6 Every natural number n is a finite set. 


Corollary 18.7 [AC] A set S is finite if and only if S is empty or in one-to-one correspon- 
dence with some natural number n. 


Theorem 18.8 The recursively constructed function theorem. Let S be a set, k: P(S) > 
S be a well-defined function on A(S) and f C Nx S be a relation. We write f(n) =a 
if and only if (n,a) € f. Let m € S. Suppose the relation f satisfies the two properties 


{ f(0)=m = (0,m) = (0, f(0)) € f 
(n, f(n Ef => (n+1,k(S—{f(0), FQ)... f= (+1, fln+1)) Ef 


Then f is a well-defined function on N. 


Theorem 18.9 [AC] A set S is an infinite set if and only if it contains a one-to-one image 
of the N. 


Theorem 18.10 If the set S is a finite set and f : S — X is a function, then f[S] is finite. 


Theorem 18.11 If a set S contains n elements then Z (S) contains 2” elements. Hence, 
if a set S is a finite set, then the set Z (S) is finite. 


VI Infinite sets 19 / Countable and uncountable sets. 


Definition 19.1 Two sets A and B are said to be equipotent sets if there exists a one-to- 
one function f : A — B mapping one onto the other. If A and B are equipotent we 
will say that “A is equipotent to B” or “A is equipotent with B”. 


Definition 19.2 Countable sets are those sets that are either finite or equipotent to N. All 
infinite sets which are not countable are called uncountable sets. 


Theorem 19.3 A subset of a countable set is countable. 
Theorem 19.4 Any finite product, N x N x --- x N, of N is a countable set. 


Lemma 19.5 Suppose f maps an infinite countable set A onto a set B = f[A]. Then B is 
countable. 


Theorem 19.6 Let {A; : 71 € S C N} be a countable set of non-empty countable sets Aj. 


Then U,<g5 Ai is countable. 
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Theorem 19.7 The set of all real numbers R is uncountable. 


VI Infinite sets 20 / Properties of the equipotence relation. 


Theorem 20.1 Let Y be a class of sets. The equipotence relation Re on Z is an equiva- 
lence relation on S. 


Theorem 20.2 Suppose A, B, C and D are sets such that A ~e B and C ~e D where 
ANC = Ø= BND. Then (AU C) ~e (BUD). 


Theorem 20.3 Suppose A, B, C and D are sets such that A ~e B and C ~e D. Then 
AxC ~weBxD. 


Corollary 20.4 Suppose A and B are infinite sets. 
1) If {A, B} c [N]. then A x Be [N]e. 
2) If {A, B} c [R]e then A x Be [R]e. 


Theorem 20.5 If {A; : i = 0,1, 2,...,n} is aset of n non-empty countable sets then Į [i-o Ai 
is countable for all n. 


Theorem 20.6 Suppose S is and infinite set and T is a countable set such that SAT = Ø. 
Then S ~e SUT. 


Theorem 20.7 If the sets A and B are equipotent, then so are their associated power sets 


P(A) and P(B). 


Theorem 20.8 Any non-empty set S is embedded in its power set Y(S). But no subset 
of S is equipotent with Y(S). 


Definition 20.9 We will say that the non-empty set A is properly embedded in the set B 
if A is equipotent to a proper subset of B but B is not equipotent to A or any of its 
subsets. To represent the relationship “A is properly embedded in B” we will write 


A =e B 


If A and B are sets such that A is equipotent to a subset of B (where B may, or 
may not, be embedded in A), we will say that A is embedded in B. To represent the 
relationship “A is embedded in B” we will write 


Aen B 


Definition 20.10 Let Z = {9 : S is a set} and & = {[S]je: SE F }. Let [A]e and [B]e 
be elements of &. We write 
[Ale <e [Ble 
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if and only if A, B. We write 
[Ale <e [B]e 
if and only if A>. B. 


Proposition 20.11 Let S be any set. Suppose #°(S) = S, A'(S) = P(S) and A"(S) = 
P(P"(S)) for all n > 1. The set 


{[Sle,[P(S)le [F7(Sles [Pe o [F"(S)le- ++} 


forms an infinite <,-ordered chain of distinct classes in &. 
Theorem 20.12 For every any non-empty set S, 25 ={yr:T € A(S)} ~e P(S) 
Theorem 20.13 The set R is embedded in A(N). 


Theorem 20.14 The set Y(N) is embedded in R. 


VI Infinite sets 21 / The Shréder-Bernstein theorem. 


Theorem 21.1 (The Schréder-Bernstein theorem) If S and T are infinite subsets where S 
is embedded in T and T is embedded in S then S and T are equipotent. 


Lemma 21.2 Let T be a proper subset of the set S and f : S — T be a one-to-one function 
mapping S into T. Then there exists a one-to-one function h : S — T mapping S 
onto T. 


Theorem 21.3 The set R of all real numbers is equipotent to A(N). 
Theorem 21.4 The sets NN and R are equipotent. 


Definition 21.5 If A and B are two sets, then the symbol B4 refers to the set of all func- 
tions mapping A into B. 


VII Cardinal numbers 22 / An introduction to cardinal numbers. 
Postulate 22.1 There exists a class of sets @ which satisfies the following properties: 


1. Every natural number n is an element of @. 
2. Any set S € Z is equipotent with precisely one element in @ 
The sets in @ are called cardinal numbers. When we say that a set S has cardinality 


k we mean that k € @ and that S ~e K. If the set S has cardinality k, we will write 
[S| = k. 
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Definition 23.2 If S and T are sets and K =|S| and A = |T| then we define addition“+” , 


Oy 


multiplication “x” and exponentiation of two cardinal numbers as follows: 


a) If SNT = Ø, 


K+A=|SUT| 
b) 
Rx A= [ST 
c) 
kò = | ST] 





where ST represents the set of all functions mapping T into S (as previously 
defined). That is, |S|!7! = |ST|. For convenience we define 0* = 0 and k? = 1. 


Theorem 22.3 The class @ of all cardinal numbers is a proper class. 


VII Cardinal numbers 23 / Arithmetic of cardinal numbers. 


Theorem 23.1 Addition on @ is well-defined. That is, if S1, S2, Ti and To are sets such 
that s = |S | = |So| and A = [Ti] = [To], then |S UTi =K+A=|S.UT)I. 
Theorem 23.2 Let x, à, 6 and w be any four cardinal numbers. Then 
a) K+A=A+K  (Commutativity of addition) 
b) (K+A) + G=K4 (A+@)  (Associativity of addition) 
c) KIKA 
d) k<ando<y=>rk+o<++y. 


Theorem 23.3 Multiplication on @ is well-defined. That is, if S1, S2, Ti and T are sets 
such that « = |S | = |S2| and A = |Ti] = |T9|, then |S1 x Ty] = K x A = | S2 x Tol. 


Theorem 23.4 Let x, 4, 6 and w be any three cardinal numbers. Then 


a 


b 


) KX AX=XX K  (Commutativity of multiplication) 
) (Kx A) xX 6=KX (AX P) (Associativity of multiplication) 
c) KX (A+) =(K XA) + (KX p) (Left-hand distributivity) 
d) A>O0SK<(KxA) 
) 
) 
) 


e) K<Aand@<Y~SRxG@<aAxv. 


f) k+K=2xXk. 


g) K+K <4 Xk when k > 2. 
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VII Cardinal numbers 24 / Exponentiation of cardinal numbers. 


Theorem 24.1 Exponentiation on @ is well-defined. That is, if S, S*, T and T* are sets 
such that |S| = |S*| and |T| = |T*|, then |S7| = |$*7" 





Theorem 24.2 Let k, A and ¢ be any three cardinal numbers. Then 


a) KTI = kò x KP 
b) (wm)? = Ki? 
c) (Kx A)? = KP x dP. 


Theorem 24.3 Let x, A, and qa be infinite cardinal numbers. Then 
a) k < kò 
b) a<k=> aà < kò 


c) a <A> ROR 


VII Cardinal numbers 25 / Sets of cardinality c 


Theorem 25.1 Let C denote the set of all complex numbers and J denote the set of all 
irrational numbers. Let n denote the cardinality of a non-empty finite set. 
a) The cardinality of R” is c. 
b) The cardinality of C is c. 
c) The cardinality of J is c. 


Theorem 25.2 


a) Let Zg denote the set of all countably infinite sequences of real numbers. The 
cardinality of Ap is c. 


b) Let Ay denote the set of all countably infinite sequences of natural numbers. The 
cardinality of Ay is c. 


c) Let Nù- denote the set of all one-to-one functions mapping N to N. The cardi- 
nality of Nù -1 is c. 
d) Let Ri-1) denote the set of all one-to-one functions mapping N to R. Then the 


cardinality of Ri-1) is c. 


Proposition 25.3 The Cantor set has cardinality c. 


VIII Ordinal numbers 26 / Well-ordered sets. 
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Theorem 26.1 Let f : T — S be a one-to-one function mapping T onto S. If T isa 
well-ordered then T induces a well-ordering on S. In particular, every countable set 
can be well-ordered. 


Definition 26.2 Given a well-ordered set (S, <), a proper subset U satisfying the property 
[u € U and z < u] = [x € U] 


is called an initial segment of S. In this definition the strict order relation < can be 
used instead of < without altering the meaning of “initial segment”. 


Theorem 26.3 If (S,<) is a well-ordered set then every initial segment in S is of the form 
Sa = {£E S: x <a} for someae S. 


Definition 26.4 Let f : (S,<,) — (T,<,) be a function mapping a well-ordered class, 
(S,<,), onto a well-ordered class, (T,<,). Note that the symbols <, and <,, will 
allow us to distinguish between the order relations applied to each set S and T. 


a) We will say that the function f is increasing on (S,<,) if 
(z <s y) => (f(z) <r fy) 
b) We will say that the function f is strictly increasing on (S,<,) if 


(£ <s y) = (fæ) <r fy) 


A strictly increasing function must be one-to-one. 
c) If f : (S,<,) — (T, <) is strictly increasing then f is said to be an order 


isomorphism mapping S$ into T. 


If there exists an onto order isomorphism between the two well-ordered classes, (S, <) 
and (T, <,.), we will say that the classes are order isomorphic, or that a function maps 
S order isomorphically onto T. 


If there exists an onto order isomorphism between the two well-ordered classes (S, <,) 
and (T, <,) we will say that the classes are order isomorphic or that a function maps 
S order isomorphically onto T. 


Theorem 26.5 Let (S, <s) and (T, <r) be a well-ordered sets. 


a) The inverse of an order isomorphism is an order isomorphism. 


b) If f : (S,<s) — (S, <s) is a strictly increasing function mapping S into itself 
then f(x) > x for alla € S. 


c) The set S cannot be order isomorphic to an initial segment of itself. 


d) If f: (S, <s) — (S, <s) is an order isomorphism then f is the identity function. 
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e) If f : (S,<s) —> (T,<r) and g : (S,<s) — (T,<r) are order isomorphisms 
mapping S onto T then f =g. 


f) Suppose f : (S,<s) — (T,<r) is an order isomorphism mapping S onto an 
initial segment of T. Then S and T cannot be order isomorphic. 


Notation 26.6 Let S and T be two well-ordered sets. Then the expression 
S wwo T 
means “S and T are order isomorphic”. The expression 
Sano Lf 
means “S ~wo Ta” where Ta is some initial segment of T. The expression 
S <w T 
means “S ~wo T or S <wo T”. 


Theorem 26.7 Let (5,<,) and (T, <„) be two well-ordered sets. Then either S <wo T 
or T <wo S. 


Proposition 26.8 For every natural number n, the lexicographically ordered set S = 
{1,2,...,2} x N is well-ordered. 


VIII Ordinal numbers 27 / Ordinal numbers: Definition and properties. 


Definition 27.1 Let S be a set. If S satisfies the two properties, 


1) Sa transitive set, 
2) S is strictly €-well-ordered 


then S is called an ordinal number. 


Notation 27.2 When viewed as an ordinal number, N will be represented by the lower-case 
Greek letter w. 


Theorem 27.3 If a is an ordinal number then so is its successor at = a U {a}. 


Definition 27.4 Suppose the set S is <-ordered. We say that an element y in S is an 
immediate successor of the element x if x < y and there does not exist any element z 
in S such that x < z < y. We say that x is an immediate predecessor of y if y is an 
immediate successor of x. 


Theorem 27.5 Let a be an ordinal number greater than zero. 


a) Every element x of the ordinal a is an initial segment of a. 
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b) The ordinal a is an initial segment of some ordinal. 
c) Every initial segment x in a is an ordinal number. 


d) Every element of the ordinal a is an ordinal number. 


Proposition 27.6 Any infinite ordinal not equal to w contains w. 

Proposition 27.7 Let a and @ be distinct ordinal numbers. If a C 8, then a € 2. 
Lemma 27.8 If the ordinals a and 8 are order isomorphic, then a = £. 
Theorem 27.9 The relation “€” linearly orders the class of all ordinals. 


Definition 27.10 An ordinal a which does not contain a maximal element is called a limit 
ordinal. 


Proposition 27.11 If U is a non-empty set of ordinals which contains a maximal element 
B with respect to €, then the union, U{a : a € U}, is equal to the maximal ordinal, 
B, of U. 


Theorem 27.12 If U is a set of ordinals which does not contain a maximal element with 
respect to “€”, then y = U{a: a € U} is a limit ordinal which is not contained in U. 
Furthermore, y is the €-least ordinal which contains all elements of U. 


Corollary 27.13 Let U be a non-empty set of ordinals which contains no maximal element. 
If U satisfies the “initial segment property”, then U is the limit ordinal U{a: a € U}. 


Definition 27.14 Let T be a non-empty subset of an ordered set (S,<). If u is an upper 
bound of the set T and, for any other upper bound v of T, u < v, then we say that u 
is the least upper bound of T. We also abbreviate the expression by writing u = lub T 
or u = lub(T). 


Theorem 27.15 Let y be a non-zero ordinal number. The following are equivalent: 


1) The ordinal y is a limit ordinal. 
2) The ordinal y is such that y = U{a : a € y}. 
3) The ordinal y is such that lub(y) = y. 


VIII Ordinal numbers 28 / Properties of the class of ordinal numbers. 

Theorem 28.1 The class, @, of ordinal numbers is a strict €-linearly ordered class. 
Theorem 28.2 The class @ of all ordinal numbers is €-well-ordered. 

Theorem 28.3 A set S is an initial segment of @ if and only if S is an ordinal number. 


Theorem 28.4 The class @ of all ordinal numbers is not a set. 
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Theorem 28.5 Principle of transfinite induction. Let {rq : a E€ @} be a class whose 
elements are indexed by the ordinals. Let P denote a particular element property. 
Suppose P(a) means “the element £a satisfies the property P”. Suppose that, for 
any BE CO, 

“P(qa) is true V a € 8” implies “P(8) is true” 


Then P(qa) holds true for all ordinals a € @. 


Corollary 28.6 Transfinite induction. Version 2. Let {£a : a E€ Ê} be a class whose 
elements are indexed by the ordinals. Let P denote a particular element property. 
Suppose P(a) means “the element £a satisfies the property P”. Suppose that: 


1) P(0) holds true, 
2) P(a) holds true implies P(a + 1) holds true, 
3) If 3 is a limit ordinal, “P(q) is true for all a € 8 implies P(8) is true”. 


Then P(a) holds true for all ordinals a. 


Theorem 28.7 Let S be a <-well-ordered set. Then S is order isomorphic to some ordinal 
number a € Ê. Furthermore this order isomorphism is unique. 


Definition 28.8 Let S be a <-well-ordered set. If œ is the unique ordinal which is order 
isomorphic to S then we will say that S is of order type a, or of ordinality a. of S is 
a. If S is of ordinality a, we will write °¢S = a. 


Lemma 28.9 Hartogs’ lemma. Let S be any set. Then there exists an ordinal œ which is 
not equipotent with S or any of its subsets. 


Theorem 28.10 There exists an uncountable ordinal. 


Corollary 28.11 The class wı = {a € @: a is a countable ordinal } is the least uncount- 
able ordinal. 


Definition 28.12 Let S be any set. Let 
U = {a € Ê: a not equipotent to any subset of S} 


By Hartogs’ lemma the class U is non-empty. Since @ is €-well-ordered, U contains 
a unique least ordinal h(S). We will call the ordinal A(S) the Hartogs number of S. 
Then h can be viewed as a class function which associates each set S in the class of 
all sets to a unique ordinal number a in the class of all ordinals @. 


Theorem 28.13 There exists a strictly increasing class {wa : a € @} of pairwise non- 
equipotent infinite ordinals all of which are uncountable except for wo = w. 


Theorem 28.14 Let {wa : a E€ @} be the class of ordinals as defined in the previous 
theorem. 
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a) Every element of {wa : a € G} is a limit ordinal. 
b) For every ordinal a, either a € wa or a = wa. 
Theorem 28.15 The Transfinite recursion theorem. Let W be a well-ordered class and 


f:W — W be a class function mapping W into W. Let u € W. Then there exists a 
unique class function g : © — W which satisfies the following properties: 


a) g(0) =u 
b) gla*) = f(g(a)), Vae @ 
c) g(B) = lub{g(a): a € P}, V limit ordinals 8 


VIII Ordinal numbers 29 / Initial ordinals: “Cardinal numbers are us!” 


Definition 29.1 We say that ( is an initial ordinal if it is the least of all ordinals equipotent 
with itself. That is, 8 is an initial ordinal if a E€ B = a £e b. 


Lemma 29.2 The class of all initial ordinals is a subclass of J. 
Theorem 29.3 The class -¥ is precisely the class of all initial ordinals. 
Theorem 29.4 [AC] The Well-ordering theorem. Every set can be well-ordered. 
Theorem 29.5 The class of all initial ordinals % = wo U {wa : a E€ Ê} satisfies the follow- 
ing properties : 
1. Every set S is equipotent to exactly one element in .%. 


2. Two sets S and T are equipotent if and only if they are equipotent to the same 
element of J. 


3. The class J is €-linearly ordered. 


Definition 29.6 Cardinal numbers. An ordinal is called a cardinal number if and only if 
this ordinal is an initial ordinal. The class, J, is also referred to as the class, @, of 
all cardinal numbers. 


Lemma 29.7 For any infinite cardinal number x, define a relation <, on «K x «K as follows. 
For pairs (a, 3) and (y, Y) of ordinal pairs in k x k, 


auUBeyUw 
or 


(a, B) <x (7, ¥) BewwhenaUB=yUw 
or 
aéeéywhenaVUG=yUy and B=y 


Then <, well-orders k x kK. 
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Theorem 29.8 [AC] For any ordinal a, Na X Na = Na. 


Corollary 29.9 Let « be an infinite cardinal and {Aa : a € 3} be a set of non-empty sets 
indexed by the elements of the ordinal 8 €= «k where |Aq| E= « for all a € 8. Then 
|U {Aq :a@€ B}| E= Kk. 


Corollary 29.10 For any infinite cardinal number k, kf = 2*. 


Definition 29.11 


a) We say that an infinite cardinal number, N4, is a successor cardinal if the index, 
y, has an immediate predecessor (i.e., y = 3+ 1, for some 3). The expression, 
Na+ = Na41, denotes a successor cardinal. We say that an infinite cardinal num- 
ber, Ny, is a limit cardinal if y is a limit ordinal (i.e., y = lub{a: a € y}). 


b) We say that a limit cardinal N4 is a strong limit cardinal if Ñ} is uncountable and 
{2 a eq} Cry. 


Theorem 29.12 [GCH] Every uncountable limit cardinal is a strong limit cardinal. 


Theorem 29.13 [AC] There exists a strong limit cardinal number. 


Definition 29.14 


a) We say that an infinite cardinal X4 is a singular cardinal number if R} is the least 
upper bound of a strictly increasing sequence of ordinals, {a, : x E€ B}, indexed 
by the elements of some ordinal ( in ÑN}. 


b) An infinite cardinal §, is said to be a regular cardinal number if it is not a singular 
cardinal number. That is, there does not exist an ordinal, 3, in N} such that 
Ny = lub{dj,t K E€ B}. 


Theorem 29.15 Every infinite successor cardinal, Na+ = Na41, is a regular cardinal. 


Theorem 29.16 Let y be an infinite cardinal number. If y is a singular cardinal then the 
cardinal number §, is a singular cardinal. 


Definition 29.17 A regular cardinal number which a limit cardinal is called an inaccessible 
cardinal. A regular cardinal number which is a strong limit cardinal is called a strongly 
inaccessible cardinal. 


Definition 29.18 Let N, be an infinite cardinal. We say the cofinality of X} is 6 and write 
cf(%) = 8 if 8 is the least ordinal in 8, which indexes an increasing set of ordinals 
{0a : a < 3} such that X, = lub{@, : a < 8}. If no such £ exists in Ny then we say 
the cofinality cf(&,) of is Ny and write cf(N,) = Ny. 
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Theorem 29.19 The cofinality cf(&,) of an infinite cardinal number N4 is a cardinal num- 
ber. Hence cf(€,) is the smallest cardinality of all sets which are cofinal in X4. 
Theorem 29.20 The cofinality cf(w) of an infinite cardinal number y¢ is a regular cardinal. 


Theorem 29.21 If x is an infinite cardinal and cf(K) < À, then k < kò. 


IX More on axioms: Choice, regularity and Martin’s axiom 30 / Axiom of choice 


Theorem 30.1 Suppose Z is a finite set of non-empty sets whose union is the set M. 
Then there exists a function f :: Z — M which maps each set to one of its elements. 


Theorem 30.2 Let AC* denote the statement: 
“For any set Y = {Sa : & E y} of non-empty sets, HacySa is non-empty.” 


The Axiom of choice holds true if and only if AC* holds true. The Axiom of choice 
holds true if and only if AC* holds true. 


Theorem 30.3 The statement “Every set is well-orderable” holds true if and only if the 
Axiom of choice holds true. 


Theorem 30.4 [AC] Any infinite set can be expressed as the union of a pairwise disjoint 
set of infinite countable sets. 


Theorem 30.5 [AC] Let (X,<) be a partially ordered set. If every chain of X has an 
upper bound then X has a maximal element. 


Theorem 30.6 Suppose that those partially ordered sets (X, <) in which every chain has 
an upper bound must have a maximal element. Then given any subset Z C Y(S)—@ 
there exists a choice function f : Z — S which maps each set in to one of its 
elements. 


Theorem 30.7 [ZL] Every vector space has a basis. 


IX More on axioms: Choice, regularity and Martin’s axiom 31 / Axiom of regu- 
larity and cumulative hierarchy. 


Theorem 31.1 The Axiom of regularity holds true if and only if every non-empty set S 
contains a minimal element with respect to the membership relation “e”. 


Theorem 31.2 [Axiom of regularity] No set is an element of itself. 


Definition 31.3 We say that a class is well-founded if it does not contain an infinite de- 
scending chain of sets. That is, there does not exist an infinite sequence {£n : n € w} 
such that ---€ £4 € £3 € £2 € £1 E€ Xo. 
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Theorem 31.4 [AC] The Aziom of regularity and the statement “Every set is well-founded” 
are equivalent statements. 


Definition 31.5 Let x be a set. The transitive closure of x is a set ty satisfying the fol- 
lowing three properties: 
1) The set t, is a transitive set. 
2) 1C tz 


3) tz is the C-least transitive set satisfying properties 1 and 2. 


Theorem 31.6 Let x be a set. Then there exists a smallest transitive set ty which contains 
all elements of x. That is, if s is a transitive set such that x C s, then x C tz C s. 


Definition 31.7 Define the class function f : Z > Z as f(S) = P(S). The elements of 
the class {Va : a € Ê} belong to the image of the class function g(a) = Vq recursively 
defined as follows: 


g0) = VW = @ = 0 
g1) = VY = fæ) = 2?(0)={Ø}=1 
D =v = fv) = 2(1)={{8},Øø}=2!=2 
9(3) = V3 = fV) = 2P(2)={{{{8}, of}, {{2}} {a}, 2} 
g4) = Va = fV) = P(A) (16 elements) 
gat) = Vat = fVa) = PWa) 
If \ = limit ordinal, g(A) = VY 2 Uac Va 


The class {Va : a € Ê} is called the Cumulative hierarchy of sets. We define V = 
Uac Va. 


Lemma 31.8 a) Each set Va is a transitive set. 
b) Ifa € 6 then Va € Vg. Hence Va C Vg. 


Lemma 31.9 For any non-empty set B, B C V implies BE V. 
Theorem 31.10 For every set xz, x € V = Uae Va- 


Theorem 31.11 Let V = UaegVa be the class of sets constructed as described above. If 
V contains all sets then every set has a €-minimal element. 


Theorem 31.12 Let V = UacoeVa be the class of sets constructed as described above. 


a) The rank of the empty set Ø is zero. 
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b) If U € Vg then rank(U) < 8; hence U ¢ Viank(u) for all sets U. Conversely, 
rank(U) < 8 >U € Vz. 


c) If U and V are sets such that U € V then rank(U) < rank(V). 
d) If y is an ordinal then rank(y) = y. 
Proposition 31.13 Let Ø be a limit ordinal. If a and b are two elements of Vg. Then 


a,b} is an element of Vg. That is, Vg satisfies the property described by the Axiom 
B B 
of pair. 


Proposition 31.14 Let 8 be any ordinal. If U is an element of Vg then U{a : x € U} € Vg. 
That is, Vg satisfies the property described by the Axiom of union. 


Proposition 31.15 Let @ be a limit ordinal. If U is an element of Vg then Vg also contains 
a set Y = A(U) such that S C U implies S € Y. That is, Vg satisfies the property 
described by the Axiom of power set. 


Proposition 31.16 Let œ be any ordinal. For any two elements x and y of Va, if for all 
z € Valz E€ x & z E€ y), then z and y are the same set. 


Proposition 31.17 Let œ be an ordinal number such that a > wọ. Then wo € Va. 


Proposition 31.18 Let œ be an ordinal number. Then the Axiom of subsets holds true in 
Va. 


Proposition 31.19 The set Vo, satisfies the property described by the Axiom of replace- 
ment. 


Proposition 31.20 Let y be a limit ordinal. Then the set V, satisfies the property de- 
scribed by the Axiom of choice. 


Proposition 31.21 Let œ be an ordinal. Then the set Va satisfies the property described 
by the Axiom of construction. 


IX More on axioms: Choice, regularity and Martin’s axiom 32 / Martin’s axiom. 


Definition 32.1 Let (P, <) be a partially ordered set. If P contains no uncountable strong 
antichain then (P, <) is said to satisfy the countable chain condition. In this case, we 
say that (P, <) satisfies the ccc or that (P, <) is a ccc partial order. 


Definition 32.2 Let (P, <) be a partially ordered set. Let D be a subset of P such that 
for every element p in P there exists an element d in D such that d < p. A subset D 
satisfying this property is said to be dense in the partial ordering (P, <). 
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Definition 32.3 Let F be a subset of a partially ordered set (P,<). If F is non-empty 
and satisfies the two properties, 1) If x and y belong to F there exists z in F which 
is less than or equal to both z and y (i.e., F is a filter base or downward directed), 2) 
if x belongs to F and = is less than or equal to an element y of P, then y belongs to 
F (i.e., F is upward closed). A filter in (P,<) is a proper filter if it is not all of P. 
If x € P then the set of all elements above z is called a principal filter with principal 
element x. Such a filter is the smallest filter which contains x. 


Theorem 32.4 MA(«): Let « be an infinite cardinal and (P,<) be a non-empty par- 
tially ordered set satisfying the countable chain condition. Let 9 = {D € A(P): 
D is dense in P} such that |2| < «. Then there is a proper filter F C P such that, 
FAD # Ø for every set D € 2. The statement MA(No) holds true in ZFC. 


Theorem 32.5 The statement MA(2*°) fails in ZFC. 


Definition 32.6 Martin’s axiom, MA, is defined as being MA(k) where « satisfies No < 
k < 20 


Theorem 32.7 [MA] Suppose « is an infinite cardinal such that « < 250, If X is a Haus- 
dorff compact space with ccc and {Ux : a < K} is a family of open dense subsets of 
X then N{Ua : a} £ Ø. 


X Ordinal numbers arithmetic 33 / Addition. 


Definition 33.1 Let (S,<,) and (T,<,) be two disjoint well-ordered sets. We define the 


>s 
relation “<ur on SUT as follows: 


a) u< 


—SUT 


b) u< 


— SUT 


c) u< 


— SUT 


v if {u,v} C S and u <, v. 
v if {u,v} C T and u <} v. 


vifuEe S, vET. 


Theorem 33.2 Let (S,<,) and (T, <,,) be two disjoint well-ordered sets. Then the relation 
<u well-orders the set SUT. 

Definition 33.3 Let a and 8 be two ordinal numbers. Let (S,<,) and (T, <) be two 
disjoint well-ordered sets of order type a and 8 respectively.| We define a + 8 as 
follows: 

a+8=™"(SUT, <sur) 

Theorem 33.4 Let (S, <), (T,<,) and (U,<,), (V,<,,) be two pairs of disjoint well- 

ordered sets such that 


1 Addition can also be defined inductively as follows: For all a and 8, a) 8 +0 = 6, b) 8 + (a+ 1) = 
(86 +a)+1, c) 8 +a =lub{8 +y: y <a} whenever a is a limit ordinal. 
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ord Ç =a= ord [7 

ord) = B = ord 7 
Then “4(SUT, <sur) =at+6 = **(UUV, <uuv). Hence addition of ordinal numbers 
is well-defined. 

Theorem 33.5 Let a, G and y be three ordinal numbers. Then: 


a) (a + B) + y = Q + (8 + y) (Addition is associative.) 


b) For any ordinal y >0,a <a +y 

c) For any ordinal y, y <a +y 

d) a<Bß=>=a+y<b+y 

e a<bß=y+a<y+6 

f) a+ 8B =a+y = B=Y (Left term cancellation is acceptable.) 
g) a+0=a 


Theorem 33.6 Let @ be a limit ordinal. Then, for any ordinal, a, 
a+b =sup{aty:7< p} 
X Ordinal numbers arithmetic 34 / Multiplication 


Definition 34.1 Let (S, <,) and (T,<,,) be two well-ordered sets. We define the lexico- 
graphic ordering on the Cartesian product S' x T as follows: 


S1 <s $2 
(s1, t1) Ssyr (S2, t2) provided or 
81 = 892 and ti S tə 


Theorem 34.2 Let (S,<,) and (T, <) be two well-ordered sets. The lexicographic or- 
dering of the Cartesian product S x T is a well-ordering. 


Theorem 34.3 Ifthe well-ordered sets S; and S2 are order isomorphic and the well-ordered 
sets Tı and Tọ are order isomorphic then the lexicographically ordered Cartesian 
products Sı x Tı and S2 x To are order isomorphic. 


Definition 34.4 Let a and 8 be two ordinals with set representatives A and B respectively. 
We define the multiplication a x 8 as: 


ax B =™{B x A) 


The product a x 8 is equivalently written as aß, (respecting the order). Note the 
order of the terms in the Cartesian product B x A is different from the order a x 3 
of their respective ordinalities. 
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Theorem 34.5 Let a, @ and y be three ordinal numbers. Then: 


a) (yB)a = y( Ba) (Multiplication is associative.) 

b) For any y>0,a< b = ya < 7 

c) yla + B) =Y&a+ yb (Left-hand distribution is acceptable.) 

d) For any y > 0, ya = yB = @= GB (Left-hand cancellation is acceptable.) 
e) yO=0 

f) For any limit ordinal 6 4 0, aß = sup {ay : y < 8} 


Definition 34.6 Let y be any non-zero ordinal. We define the y-based exponentiation 
function gy: @ — @ as follows: 


1) (0) =1 
2) glat) = gy(a)y 
3) gla) = lub{g,(8) : 8 < a} whenever a is a limit ordinal. 


Whenever  # 0 we represent g,(a) as y®. Then 7°t! = °y. If y = 0 we define 
“= 0% =0. 


Theorem 34.7 Let a, 6 and y be three ordinal numbers. Then, assuming y > 1, 


a<Bpey <7? 


Theorem 34.8 Let a, @ and y be three ordinal numbers where a Æ 0. 


unre 


b) (78) = a 


Appendix A / Boolean algebras and Martin’s axiom. 


Definition 0.1 A partially ordered set (P, <) is called a lattice if a V b = max{a, b} and 
a ^b = min{a, b} both exist in P for all pairs a, b in P. 


Definition 0.2 If B is a subset of a partially ordered set, (P,<), VB denotes the least 
upper bound of B and AB denotes the greatest lower bound of B (both with respect 
to <). Note that VB and AB may or may not be an element of B. A lattice (P, <) 
is said to be a complete lattice if for any non-empty subset B of P, both VB and AB 
exist and belong to P. 


Definition 0.3 Let X be a topological space. A subset B is said to be regular open in 
X if B = intx(clx(B)). The set of all regular open subsets of X will be denoted as 
Ko(X). 


438 List of definitions and statements. 


Theorem 0.4 Let X be a topological space. Then (%0(X), C,V,M) is a complete lattice 
in (7(X), C). 


Definition 0.5 Let (L,<,V,/) bea lattice. An L-ultrafilter is a proper filter F in L which 
is not properly contained in any other proper filter in L. If the filter F is such that 
{Ff : F € F} # @ then we say that the filter F is a fixed ultrafilter. Ultrafilters 
which are not fixed are said to be free ultrafilters. 


Theorem 0.6 Let X be a topological space. 


a) Suppose F is a proper L-filter where (L, C, V, A) is a lattice in (A(X), C). Then 
F can be extended to an L-ultrafilter. 


b) Suppose F is an L-filter in (L,C,V,A) a lattice in (A(X), C). Then F is an 
L-ultrafilter if and only if for every A C X, either A € F or X- AE F. 


Theorem 0.7 Let X be a topological space. Then (%0(X), C,V,M) is a complete lattice 
in (7(X),C). An £0(X)-filter, F, is an Zo(X)-ultrafilter if and only if, for any 
A € &o(X), either A or X — clx(A) belongs to F. 


Definition 0.8 A lattice (L, V, A^) is said to be a distributive lattice if, for any x, y, and z 
in L, x V (y Az) = (xV y)A^ (zV az) and ad (yV z) =(@Ay)V (aA 2). 


- The lattice (L, V,^) is said to be a complemented lattice it has a maximum 
element, denoted by 1, and a minimum element, denoted by 0, and for every 
x € L there exists a unique x’ such that x V x’ = 1 andzAa2’=0. 


- A complemented distributive lattice is referred to as being a Boolean algebra. 
A Boolean algebra is denoted as (B,<,V,/,0,1,’) when we explicitly want to 
express what the maximum and minimum elements are. 


Definition 0.9 Suppose we are given two lattices (B1, <1, V1, A1, 0, 1,’) and (B2, <2, V2, ^2, 
and a function f which maps elements of Bı to elements of By. We say that f : Bı > 
Bə is a Boolean homomorphism if, for any x,y € B, 


1) f(x Vi y) = f(x) V2 f(y), 
2) f(z My) = f(x) ^ f(y) 
3) f(x) = f(a)’. 


The function f : Bı —> Bə is a Boolean isomorphism if f is a bijection and both f 
and f“ are Boolean homomorphisms. 


Definition 0.10 Let (B,<,V,A,0,1,’) be a Boolean algebra. Let 7(B) = {1% : V isa 
B-ultrafilter}. We define the function fg : B —> Y(S(B)) as follows: falx) ={F € 
S(B): cE Yh. 
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Theorem 0.11 Let (B,<,V,A,0,1,’) be a Boolean algebra. Then the set {fp(x) : x € B} 
is a base for the open sets of some topology, 7(.7(B)), on the set Y(B) of all B- 
ultrafilters. 


Theorem 0.12 Let (B,<,V,A,0,1,’) be a Boolean algebra. 


1) The function fg : B — Y(.Y(B)) is a Boolean homomorphism mapping B into 
P(S(B)). 

2) For every x € B, fg(x) is clopen in .Y(B). Hence fg|B] C B(Y(B)) (the set of 
all clopen sets in Z (B)). 


3) The function fg : B — (B) is a Boolean isomorphism mapping B into 
B(S(B)) (the set of all clopen sets in Y(B)). 

4) The topological space (.7(B),7(.7(B))) where 7(.W(B)) is the topology gener- 
ated by the open base {fp (x) : x € B} is a compact zero-dimensional Hausdorff 
topological space. 


5) The Boolean isomorphism fg : B —> (B) maps B onto A(Z (B)) (the set of 
all clopen sets in Z (B)). 


Theorem 0.13 Let « be an infinite cardinal number such that x < 2%0. Then the following 
are equivalent: 


1) (Martin’s axiom MA) If (P, <) is a partially ordered set satisfying ccc and 9 = 
{Da : a < k} is a family of dense subsets of P, then there exists a filter F on P 
such that F N Da 4 Ø for each a < «x. 


2) If X is a compact Hausdorff topological space satisfying ccc and 2 = {Da :a< 
k} is a family of dense open subsets of X, then N{Da : Qa < Kk} Æ Ø. 


3) If (B,<,V,A, ’) isa Boolean algebra with the ccc property and 2 = {Da : a < k} 
is a family of dense subsets of B, then there exists a filter F on B such that 
F N Da # Ø for each a < kK. 


4) If (P, <) is a partially ordered set satisfying ccc and |P| < x and 2 = {Da : & < 
k} is a family of dense subsets of P, then there exist a filter F on P such that 
F N Da # Ø for each a < kK. 


Theorem 0.14 Let x be a cardinal such that No < k < 20. Let X be a Hausdorff 
topological space satisfying ccc such that {x € X : x has a compact neighbourhood} 
is dense in X. Suppose that J = {Da : a < r} is a family of dense open subsets of 
X. Then N{Da : a < k} is dense in X if and only if Martin’s axiom holds true. 
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